Trending Projects

Discover the fastest growing open source projects

Showing 101-150 of 897 trending projects

#101
tcgoetz/GarminDB

A Python library for downloading, parsing, and analyzing health data from Garmin, FitBit, and MS Health.

+6
+0.2%
2.9K
total stars
#102
apache/fluss

Apache Fluss is a real-time streaming storage platform built for big data analytics.

+6
+0.3%
1.8K
total stars
#103
avhz/RustQuant

A Rust library for quantitative finance, including tools for machine learning, option pricing, and trading.

+6
+0.4%
1.7K
total stars
#104
networkx/networkx

networkx is a Python library for creating, manipulating, and studying the structure and dynamics of complex networks.

+5
+0.0%
16.7K
total stars
#105
apache/arrow

Apache Arrow is a fast columnar data format and toolset for in-memory analytics and data interchange.

+5
+0.0%
16.6K
total stars
#106
dgraph-io/badger

Fast, embeddable key-value database written in Go for building high-performance storage applications.

+5
+0.0%
15.5K
total stars
#107
dagster-io/dagster

An open-source data orchestration platform for developing, running, and observing data pipelines and workflows.

+5
+0.0%
15.1K
total stars
#108
citusdata/citus

Citus is a distributed PostgreSQL database that enables scaling out your Postgres database across multiple nodes.

+5
+0.0%
12.3K
total stars
#109
vesoft-inc/nebula

Nebula is a fast, open-source, distributed graph database with horizontal scalability and high availability.

+5
+0.0%
12.1K
total stars
#110
statsmodels/statsmodels

Statsmodels is a Python library for statistical modeling and econometrics, providing tools for data analysis and prediction.

+5
+0.0%
11.3K
total stars
#111
alexeygrigorev/data-science-interviews

A repository of data science interview questions and answers for developers.

+5
+0.1%
9.8K
total stars
#112
hugo2046/QuantsPlaybook

A quantitative research and stock analysis platform for finance professionals.

+5
+0.1%
4.5K
total stars
#113
shashankvemuri/Finance

A comprehensive collection of 150+ Python programs for quantitative finance and stock market data analysis.

+5
+0.1%
3.6K
total stars
#114
jorgerojas26/lazysql

A cross-platform TUI database management tool written in Go for developers working with databases.

+5
+0.1%
3.5K
total stars
#115
documentdb/documentdb

MongoDB-compatible database engine for cloud-native and open-source workloads with scalability and performance.

+5
+0.2%
3.2K
total stars
#116
MIT-LCP/mimic-code

Open-source repository for sharing code related to the MIMIC family of critical care databases.

+5
+0.2%
3.1K
total stars
#117
apache/gravitino

An open-source data catalog platform for building a high-performance, federated metadata lake.

+5
+0.2%
2.9K
total stars
#118
mpquant/MyTT

A Python library with most common stock market technical indicators, making it easy to implement quantitative finance and algorithmic trading.

+5
+0.2%
2.6K
total stars
#119
veb-101/Data-Science-Projects

A collection of data science projects in Python using Jupyter Notebook.

+5
+0.2%
2.6K
total stars
#120
duckdb/ducklake

DuckLake is an integrated data lake and catalog format written in C++.

+5
+0.2%
2.5K
total stars
#121
supabase/etl

A real-time Postgres data replication and streaming library built in Rust for building CDC pipelines.

+5
+0.2%
2.2K
total stars
#122
moj-analytical-services/splink

Fast, accurate, and scalable probabilistic data linkage with support for multiple SQL backends.

+5
+0.3%
2.0K
total stars
#123
fjall-rs/fjall

A high-performance, embeddable key-value storage engine written in Rust for developers building data-intensive applications.

+5
+0.3%
1.9K
total stars
#124
probberechts/soccerdata

A Python library for scraping soccer data from various sources for sports analytics and data science.

+5
+0.3%
1.6K
total stars
#125
pgvector/pgvector-python

A Python library that provides support for the pgvector vector database, enabling efficient vector search and storage.

+5
+0.3%
1.4K
total stars
#126
bruin-data/bruin

A data platform that enables building data pipelines with SQL, Python, and ingesting from various sources.

+5
+0.3%
1.4K
total stars
#127
Azure/AzurePublicDataset

Azure/AzurePublicDataset is a repository containing Microsoft Azure Traces, a Jupyter Notebook-based resource.

+5
+0.5%
1.1K
total stars
#128
knex/knex

SQL query builder for multiple databases

+4
+0.0%
20.2K
total stars
#129
apple/foundationdb

FoundationDB is an open-source, distributed, transactional key-value store that provides ACID guarantees.

+4
+0.0%
16.2K
total stars
#130
scylladb/scylladb

A high-performance NoSQL data store compatible with Apache Cassandra and Amazon DynamoDB.

+4
+0.0%
15.4K
total stars
#131
scipy/scipy

SciPy is a Python library for scientific and technical computing, providing a wide range of algorithms and tools.

+4
+0.0%
14.5K
total stars
#132
Data-Centric-AI-Community/ydata-profiling

A Python library for fast, customizable, and interactive data profiling and exploratory data analysis.

+4
+0.0%
13.4K
total stars
#133
juicedata/juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3 for big data and cloud-native applications.

+4
+0.0%
13.3K
total stars
#134
trinodb/trino

Trino is a distributed SQL query engine for big data, allowing fast, scalable, and cost-effective analytics.

+4
+0.0%
12.6K
total stars
#135
pingcap/awesome-database-learning

A comprehensive list of learning materials to help developers understand database internals.

+4
+0.0%
10.7K
total stars
#136
apache/seatunnel

A high-performance, distributed data integration tool for batch, streaming, and CDC use cases.

+4
+0.0%
9.1K
total stars
#137
mage-ai/mage-ai

mage-ai is a Python-based platform for building, running, and managing data pipelines and integrating/transforming data.

+4
+0.1%
8.7K
total stars
#138
igorbarinov/awesome-data-engineering

A curated list of data engineering tools for software developers, not focused on AI coding tools.

+4
+0.1%
8.3K
total stars
#139
msiemens/tinydb

A lightweight, document-oriented database optimized for happiness, used as a Python library or CLI.

+4
+0.1%
7.5K
total stars
#140
dbgate/dbgate

Database manager for multiple database engines, runs as desktop or web app.

+4
+0.1%
6.8K
total stars
#141
sacridini/Awesome-Geospatial

A comprehensive collection of geospatial tools and resources for data analysis, machine learning, and spatial applications.

+4
+0.1%
4.8K
total stars
#142
plotters-rs/plotters

A high-quality, cross-platform data plotting library for Rust developers, including WebAssembly support.

+4
+0.1%
4.5K
total stars
#143
theOehrly/Fast-F1

A Python package for accessing and analyzing Formula 1 racing data, including results, schedules, timing, and telemetry.

+4
+0.1%
4.5K
total stars
#144
deanmalmgren/textract

A Python library that provides a simple and unified interface for extracting text from any document format.

+4
+0.1%
4.5K
total stars
#145
zvtvz/zvt

A modular quantitative trading framework for algorithmic trading, backtesting, and financial analysis.

+4
+0.1%
4.0K
total stars
#146
skyzh/mini-lsm

A Rust-based implementation of an LSM-Tree storage engine (database) for developers to build and learn from.

+4
+0.1%
3.9K
total stars
#147
RoaringBitmap/RoaringBitmap

A high-performance compressed bitset library for Java used in Apache Spark, Netflix Atlas, and others.

+4
+0.1%
3.8K
total stars
#148
pyvista/pyvista

A Python library for 3D plotting and mesh analysis using the Visualization Toolkit (VTK)

+4
+0.1%
3.5K
total stars
#149
sqldef/sqldef

Idempotent schema management tool for MySQL, PostgreSQL, SQLite, and SQL Server databases.

+4
+0.1%
3.0K
total stars
#150
arpanghosh8453/garmin-grafana

A Python script to fetch Garmin health data and populate it in an InfluxDB database for visualization in Grafana.

+4
+0.1%
2.9K
total stars
124...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.