Trending Projects

Discover the fastest growing open source projects

Showing 451-500 of 897 trending projects

#451
paulvangentcom/heartrate_analysis_python

A Python package for analyzing heart rate data from PPG and ECG signals.

+1
+0.1%
1.1K
total stars
#452
pachterlab/gget

gget is a Python library that enables efficient querying of genomic reference databases like NCBI, Ensembl, and UniProt.

+1
+0.1%
1.1K
total stars
#453
paulmach/orb

A Go library with types and utilities for working with 2D geometry, geospatial data, and mapping.

+1
+0.1%
1.1K
total stars
#454
samapriya/awesome-gee-community-datasets

A community-driven catalog of geospatial datasets for use with Google Earth Engine.

+1
+0.1%
1.1K
total stars
#455
apachecn/pyda-2e-zh

A Chinese translation of the book 'Python for Data Analysis' 2nd Edition, covering NumPy, Pandas, and other data analysis tools.

+1
+0.1%
1.1K
total stars
#456
oetiker/rrdtool-1.x

RRDtool is a time-series database system for efficiently storing and graphing data.

+1
+0.1%
1.1K
total stars
#457
big-data-europe/docker-hive

This is a Docker container for running Apache Hive, a data warehousing tool for big data analysis.

+1
+0.1%
1.1K
total stars
#458
gunrock/gunrock

Programmable CUDA/C++ GPU Graph Analytics library for high-performance parallel graph processing.

+1
+0.1%
1.1K
total stars
#459
the-pudding/data

A repository of open-source data sets created for stories on The Pudding, a digital publication focused on data journalism.

+1
+0.1%
1.1K
total stars
#460
hail-is/hail

Cloud-native genomic dataframes and batch computing for bioinformatics and genetics research.

+1
+0.1%
1.1K
total stars
#461
rgeo/rgeo

A geospatial data library for Ruby that provides a set of tools for working with geographic data.

+1
+0.1%
1.0K
total stars
#462
pixiedust/pixiedust

A Python helper library for enhancing Jupyter Notebooks with data visualization and analysis capabilities.

+1
+0.1%
1.0K
total stars
#463
taynaud/python-louvain

A Python library for implementing the Louvain community detection algorithm on graphs.

+1
+0.1%
1.0K
total stars
#464
tidyverse/readr

A fast and flexible R package for reading flat files (CSV, TSV, fixed-width) into R data frames.

+1
+0.1%
1.0K
total stars
#465
cyang-kth/fmm

An open-source C++ framework for fast and parallel map matching of GPS trajectories.

+1
+0.1%
1.0K
total stars
#466
rstudio/pointblank

Data quality assessment and reporting tool for data frames and database tables in R

+1
+0.1%
1.0K
total stars
#467
mysql/mysql-connector-j

MySQL Connector/J is a JDBC driver that enables Java applications to connect to MySQL databases.

+1
+0.1%
1.0K
total stars
#468
sequelize/sequelize

ORM for Node.js/TypeScript with multiple database support

0
0.0%
30.3K
total stars
#469
alibaba/canal

MySQL binlog incremental subscription and consumption component

0
0.0%
29.6K
total stars
#470
rethinkdb/rethinkdb

Realtime NoSQL database for web apps

0
0.0%
27.0K
total stars
#471
typicode/lowdb

Lightweight local JSON database for JavaScript/TypeScript apps

0
0.0%
22.5K
total stars
#472
prestodb/presto

Presto is an open-source distributed SQL query engine for big data, allowing fast analysis of large datasets.

0
0.0%
16.7K
total stars
#473
cayleygraph/cayley

An open-source graph database written in Go, useful for building applications that require linked data and graph-based queries.

0
0.0%
15.0K
total stars
#474
realm/realm-java

Realm is a mobile database that serves as a replacement for SQLite and ORMs.

0
0.0%
11.5K
total stars
#475
mattn/go-sqlite3

A lightweight SQLite3 driver for Go that implements the database/sql interface.

0
0.0%
9.0K
total stars
#476
ricklamers/gridstudio

Grid Studio is a web-based application for data science with full integration of open source data science frameworks and languages.

0
0.0%
8.9K
total stars
#477
ideawu/ssdb

SSDB is a fast NoSQL database, an alternative to Redis, with support for leveldb and rocksdb backends.

0
0.0%
8.5K
total stars
#478
pentaho/pentaho-kettle

Pentaho Data Integration (ETL) is a Java-based tool for building data integration and ETL pipelines.

0
0.0%
8.3K
total stars
#479
allegro/bigcache

Efficient in-memory cache in Go for storing and retrieving large amounts of data.

0
0.0%
8.1K
total stars
#480
attic-labs/noms

The versioned, forkable, syncable database for developers who need a scalable, distributed data solution.

0
0.0%
7.4K
total stars
#481
kennethreitz/records

Records is a Python SQL library that makes interacting with databases more intuitive and human-friendly.

0
0.0%
7.2K
total stars
#482
erikgrinaker/toydb

An educational distributed SQL database written in Rust, not focused on AI coding tools.

0
0.0%
7.2K
total stars
#483
jvns/pandas-cookbook

Pandas Cookbook is a collection of recipes for using Python's powerful data analysis library, Pandas.

0
0.0%
7.0K
total stars
#484
hazelcast/hazelcast

Hazelcast is a high-performance, distributed in-memory data platform for real-time insights and stream processing.

0
0.0%
6.6K
total stars
#485
syndtr/goleveldb

LevelDB key/value database in Go for building high-performance data-intensive applications.

0
0.0%
6.3K
total stars
#486
pachyderm/pachyderm

Pachyderm is a data-centric pipeline and data versioning platform for building and scaling data-intensive applications.

0
0.0%
6.3K
total stars
#487
niderhoff/nlp-datasets

A curated list of free/public domain text datasets for natural language processing (NLP) tasks.

0
0.0%
6.0K
total stars
#488
apache/hbase

Apache HBase is a distributed, scalable, fault-tolerant database for large datasets built on top of HDFS.

0
0.0%
5.6K
total stars
#489
airbnb/knowledge-repo

A next-generation curated knowledge sharing platform for data scientists and other technical professionals.

0
0.0%
5.5K
total stars
#490
lux-org/lux

Automatically visualize your pandas dataframes with a single print command, enabling quick EDA.

0
0.0%
5.4K
total stars
#491
JoshClose/CsvHelper

A C# library for reading and writing CSV files, with support for a wide range of CSV file formats.

0
0.0%
5.2K
total stars
#492
jeremyevans/sequel

Sequel is a Ruby library that provides a powerful and flexible object-relational mapping (ORM) for databases.

0
0.0%
5.1K
total stars
#493
orientechnologies/orientdb

OrientDB is a versatile, multi-model DBMS that supports Graph, Document, Reactive, Full-Text, and Geospatial models.

0
0.0%
4.9K
total stars
#494
mathesar-foundation/mathesar

An open-source, self-hosted database management tool with a spreadsheet-like interface for Postgres

0
0.0%
4.9K
total stars
#495
lk-geimfari/mimesis

Mimesis is a fast Python library for generating fake data in multiple languages for testing and development purposes.

0
0.0%
4.8K
total stars
#496
amundsen-io/amundsen

Amundsen is an open-source data discovery platform for improving productivity of data analysts and engineers.

0
0.0%
4.7K
total stars
#497
datawhalechina/competition-baseline

A collection of code examples and baselines for common data science and machine learning competitions.

0
0.0%
4.7K
total stars
#498
dedupeio/dedupe

A Python library for accurate and scalable fuzzy matching, record deduplication, and entity resolution.

0
0.0%
4.4K
total stars
#499
CLUEbenchmark/CLUEDatasetSearch

A comprehensive search tool for finding Chinese NLP datasets, with support for common English NLP datasets as well.

0
0.0%
4.4K
total stars
#500
MongoEngine/mongoengine

MongoEngine is a Python Object-Document-Mapper (ODM) for working with MongoDB databases.

0
0.0%
4.4K
total stars
1...911...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.