Trending Projects

Discover the fastest growing open source projects

Showing 101-150 of 897 trending projects

#101
google/leveldb

Fast key-value storage library for C++

+75
+0.2%
38.9K
total stars
#102
treeverse/dvc

dvc is a data versioning and ML experiments tool that helps developers manage and track data and model changes.

+75
+0.5%
15.4K
total stars
#103
dr5hn/countries-states-cities-database

A comprehensive database of countries, states, and cities with data in multiple formats

+75
+0.8%
9.3K
total stars
#104
Rockyzsu/stock

A Python library for quantitative trading and stock analysis.

+74
+1.0%
7.2K
total stars
#105
GreptimeTeam/greptimedb

Open-source, cloud-native, unified observability database for metrics, logs and traces, supporting SQL/PromQL/Streaming.

+74
+1.3%
6.0K
total stars
#106
ijl/orjson

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

+72
+0.9%
7.9K
total stars
#107
youssefHosni/Data-Science-Interview-Questions-Answers

A curated list of data science interview questions and answers for developers.

+72
+1.3%
5.5K
total stars
#108
typeorm/typeorm

ORM for TypeScript and JavaScript with support for multiple databases and platforms.

+70
+0.2%
36.4K
total stars
#109
apache/flink

Apache Flink is a stream processing framework for real-time and batch data processing.

+70
+0.3%
25.8K
total stars
#110
scylladb/scylladb

A high-performance NoSQL data store compatible with Apache Cassandra and Amazon DynamoDB.

+70
+0.5%
15.4K
total stars
#111
PRQL/prql

PRQL is a modern, powerful, and pipelined SQL replacement for transforming data.

+70
+0.7%
10.7K
total stars
#112
simonw/datasette

An open-source multi-tool for exploring and publishing data, focused on simplifying data analysis and sharing.

+69
+0.6%
10.8K
total stars
#113
theOehrly/Fast-F1

A Python package for accessing and analyzing Formula 1 racing data, including results, schedules, timing, and telemetry.

+69
+1.6%
4.5K
total stars
#114
duckdb/ducklake

DuckLake is an integrated data lake and catalog format written in C++.

+69
+2.8%
2.5K
total stars
#115
pingcap/awesome-database-learning

A comprehensive list of learning materials to help developers understand database internals.

+68
+0.6%
10.7K
total stars
#116
oxnr/awesome-bigdata

A curated list of awesome big data frameworks, resources and other awesomeness.

+65
+0.5%
14.3K
total stars
#117
dexie/Dexie.js

Dexie.js is a minimalistic IndexedDB wrapper that simplifies offline storage and database management in web applications.

+65
+0.5%
14.1K
total stars
#118
tcgoetz/GarminDB

A Python library for downloading, parsing, and analyzing health data from Garmin, FitBit, and MS Health.

+64
+2.2%
2.9K
total stars
#119
redis/go-redis

Redis client for Go with support for Redis 8.0+

+63
+0.3%
22.0K
total stars
#120
zhu-xlab/GlobalBuildingAtlas

GlobalBuildingAtlas is an open global and complete dataset of building polygons, heights and LoD1 3D models.

+61
+3.2%
2.0K
total stars
#121
dgraph-io/badger

Fast, embeddable key-value database written in Go for building high-performance storage applications.

+60
+0.4%
15.5K
total stars
#122
PeerDB-io/peerdb

Fast, cost-effective data replication tool from Postgres to data warehouses, queues, and storage

+60
+2.0%
3.0K
total stars
#123
argoproj/argo-workflows

Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.

+59
+0.4%
16.5K
total stars
#124
zvtvz/zvt

A modular quantitative trading framework for algorithmic trading, backtesting, and financial analysis.

+58
+1.5%
4.0K
total stars
#125
dathere/qsv

Blazing-fast data wrangling toolkit for AI and data engineering workflows

+58
+1.7%
3.5K
total stars
#126
moj-analytical-services/splink

Fast, accurate, and scalable probabilistic data linkage with support for multiple SQL backends.

+58
+3.0%
2.0K
total stars
#127
pointfreeco/sqlite-data

A fast, lightweight SQLite-based persistence layer with CloudKit synchronization for Swift developers.

+58
+3.7%
1.6K
total stars
#128
apple/foundationdb

FoundationDB is an open-source, distributed, transactional key-value store that provides ACID guarantees.

+57
+0.3%
16.2K
total stars
#129
cozodb/cozo

A transactional, relational-graph-vector database that uses Datalog for query, designed for AI and ML use cases.

+57
+1.5%
3.9K
total stars
#130
garden-co/jazz

A distributed database with CRDT sync, offline support, and end-to-end encryption for vibe coders.

+57
+2.4%
2.5K
total stars
#131
treeverse/lakeFS

lakeFS is a Git-like version control system for data lakes, enabling data engineers to manage data versioning and data quality.

+56
+1.1%
5.2K
total stars
#132
vesoft-inc/nebula

Nebula is a fast, open-source, distributed graph database with horizontal scalability and high availability.

+55
+0.5%
12.1K
total stars
#133
drivendataorg/cookiecutter-data-science

A flexible and standardized cookiecutter template for doing and sharing data science work in Python.

+55
+0.6%
9.7K
total stars
#134
PyPortfolio/PyPortfolioOpt

A Python library for financial portfolio optimization, including classical efficient frontier and advanced techniques.

+55
+1.0%
5.5K
total stars
#135
dbgate/dbgate

Database manager for multiple database engines, runs as desktop or web app.

+53
+0.8%
6.8K
total stars
#136
statsmodels/statsmodels

Statsmodels is a Python library for statistical modeling and econometrics, providing tools for data analysis and prediction.

+52
+0.5%
11.3K
total stars
#137
databendlabs/databend

Unified cloud-native data warehouse platform for analytics, search and AI, built on top of S3 storage.

+52
+0.6%
9.2K
total stars
#138
taosdata/TDengine

High-performance time-series database for IoT and IIoT

+51
+0.2%
24.8K
total stars
#139
microsoft/sql-server-samples

This repository contains code samples for SQL Server, Azure SQL, and related data services from Microsoft.

+50
+0.5%
10.9K
total stars
#140
alexeygrigorev/data-science-interviews

A repository of data science interview questions and answers for developers.

+50
+0.5%
9.8K
total stars
#141
pubkey/rxdb

Reactive, local-first database for JavaScript apps with real-time sync and flexible storage

+49
+0.2%
23.1K
total stars
#142
rqlite/rqlite

A lightweight, fault-tolerant distributed database built on SQLite, designed for high availability.

+49
+0.3%
17.3K
total stars
#143
mattn/go-sqlite3

A lightweight SQLite3 driver for Go that implements the database/sql interface.

+49
+0.6%
9.0K
total stars
#144
veb-101/Data-Science-Projects

A collection of data science projects in Python using Jupyter Notebook.

+49
+2.0%
2.6K
total stars
#145
mukunku/ParquetViewer

A simple Windows desktop app for viewing and querying Apache Parquet files, a popular big data format.

+49
+4.6%
1.1K
total stars
#146
cstack/db_tutorial

A tutorial for writing a SQLite clone from scratch in C, a useful resource for developers building database-backed applications.

+48
+0.5%
10.3K
total stars
#147
plotters-rs/plotters

A high-quality, cross-platform data plotting library for Rust developers, including WebAssembly support.

+48
+1.1%
4.5K
total stars
#148
TobikoData/sqlmesh

Scalable and efficient data transformation framework with backwards compatibility for dbt.

+48
+1.7%
2.9K
total stars
#149
elastic/kibana

Kibana is an open-source data visualization and management tool for Elasticsearch

+47
+0.2%
21.0K
total stars
#150
delta-io/delta

An open-source data lakehouse framework that enables building data pipelines with leading big data compute engines.

+46
+0.5%
8.6K
total stars
124...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.