Trending Projects

Discover the fastest growing open source projects

Showing 101-150 of 897 trending projects

#101

google/leveldb

Fast key-value storage library for C++

+75

+0.2%

38.9K

total stars

C++

#102

treeverse/dvc

dvc is a data versioning and ML experiments tool that helps developers manage and track data and model changes.

+75

+0.5%

15.4K

total stars

Python

#103

dr5hn/countries-states-cities-database

A comprehensive database of countries, states, and cities with data in multiple formats

+75

+0.8%

9.3K

total stars

Python

#104

Rockyzsu/stock

A Python library for quantitative trading and stock analysis.

+74

+1.0%

7.2K

total stars

Python

#105

GreptimeTeam/greptimedb

Open-source, cloud-native, unified observability database for metrics, logs and traces, supporting SQL/PromQL/Streaming.

+74

+1.3%

6.0K

total stars

Rust

#106

ijl/orjson

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

+72

+0.9%

7.9K

total stars

Python

#107

youssefHosni/Data-Science-Interview-Questions-Answers

A curated list of data science interview questions and answers for developers.

+72

+1.3%

5.5K

total stars

#108

typeorm/typeorm

ORM for TypeScript and JavaScript with support for multiple databases and platforms.

+70

+0.2%

36.4K

total stars

TypeScript

#109

apache/flink

Apache Flink is a stream processing framework for real-time and batch data processing.

+70

+0.3%

25.8K

total stars

Java

#110

scylladb/scylladb

A high-performance NoSQL data store compatible with Apache Cassandra and Amazon DynamoDB.

+70

+0.5%

15.4K

total stars

C++

#111

PRQL/prql

PRQL is a modern, powerful, and pipelined SQL replacement for transforming data.

+70

+0.7%

10.7K

total stars

Rust

#112

simonw/datasette

An open-source multi-tool for exploring and publishing data, focused on simplifying data analysis and sharing.

+69

+0.6%

10.8K

total stars

Python

#113

theOehrly/Fast-F1

A Python package for accessing and analyzing Formula 1 racing data, including results, schedules, timing, and telemetry.

+69

+1.6%

4.5K

total stars

Python

#114

duckdb/ducklake

DuckLake is an integrated data lake and catalog format written in C++.

+69

+2.8%

2.5K

total stars

C++

#115

pingcap/awesome-database-learning

A comprehensive list of learning materials to help developers understand database internals.

+68

+0.6%

10.7K

total stars

#116

oxnr/awesome-bigdata

A curated list of awesome big data frameworks, resources and other awesomeness.

+65

+0.5%

14.3K

total stars

#117

dexie/Dexie.js

Dexie.js is a minimalistic IndexedDB wrapper that simplifies offline storage and database management in web applications.

+65

+0.5%

14.1K

total stars

TypeScript

#118

tcgoetz/GarminDB

A Python library for downloading, parsing, and analyzing health data from Garmin, FitBit, and MS Health.

+64

+2.2%

2.9K

total stars

Python

#119

redis/go-redis

Redis client for Go with support for Redis 8.0+

+63

+0.3%

22.0K

total stars

#120

zhu-xlab/GlobalBuildingAtlas

GlobalBuildingAtlas is an open global and complete dataset of building polygons, heights and LoD1 3D models.

+61

+3.2%

2.0K

total stars

Python

#121

dgraph-io/badger

Fast, embeddable key-value database written in Go for building high-performance storage applications.

+60

+0.4%

15.5K

total stars

#122

PeerDB-io/peerdb

Fast, cost-effective data replication tool from Postgres to data warehouses, queues, and storage

+60

+2.0%

3.0K

total stars

#123

argoproj/argo-workflows

Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.

+59

+0.4%

16.5K

total stars

#124

zvtvz/zvt

A modular quantitative trading framework for algorithmic trading, backtesting, and financial analysis.

+58

+1.5%

4.0K

total stars

Python

#125

dathere/qsv

Blazing-fast data wrangling toolkit for AI and data engineering workflows

+58

+1.7%

3.5K

total stars

Rust

#126

moj-analytical-services/splink

Fast, accurate, and scalable probabilistic data linkage with support for multiple SQL backends.

+58

+3.0%

2.0K

total stars

Python

#127

pointfreeco/sqlite-data

A fast, lightweight SQLite-based persistence layer with CloudKit synchronization for Swift developers.

+58

+3.7%

1.6K

total stars

Swift

#128

apple/foundationdb

FoundationDB is an open-source, distributed, transactional key-value store that provides ACID guarantees.

+57

+0.3%

16.2K

total stars

C++

#129

cozodb/cozo

A transactional, relational-graph-vector database that uses Datalog for query, designed for AI and ML use cases.

+57

+1.5%

3.9K

total stars

Rust

#130

garden-co/jazz

A distributed database with CRDT sync, offline support, and end-to-end encryption for vibe coders.

+57

+2.4%

2.5K

total stars

TypeScript

#131

treeverse/lakeFS

lakeFS is a Git-like version control system for data lakes, enabling data engineers to manage data versioning and data quality.

+56

+1.1%

5.2K

total stars

#132

vesoft-inc/nebula

Nebula is a fast, open-source, distributed graph database with horizontal scalability and high availability.

+55

+0.5%

12.1K

total stars

C++

#133

drivendataorg/cookiecutter-data-science

A flexible and standardized cookiecutter template for doing and sharing data science work in Python.

+55

+0.6%

9.7K

total stars

Python

#134

PyPortfolio/PyPortfolioOpt

A Python library for financial portfolio optimization, including classical efficient frontier and advanced techniques.

+55

+1.0%

5.5K

total stars

Jupyter Notebook

#135

dbgate/dbgate

Database manager for multiple database engines, runs as desktop or web app.

+53

+0.8%

6.8K

total stars

Svelte

#136

statsmodels/statsmodels

Statsmodels is a Python library for statistical modeling and econometrics, providing tools for data analysis and prediction.

+52

+0.5%

11.3K

total stars

Python

#137

databendlabs/databend

Unified cloud-native data warehouse platform for analytics, search and AI, built on top of S3 storage.

+52

+0.6%

9.2K

total stars

Rust

#138

taosdata/TDengine

High-performance time-series database for IoT and IIoT

+51

+0.2%

24.8K

total stars

#139

microsoft/sql-server-samples

This repository contains code samples for SQL Server, Azure SQL, and related data services from Microsoft.

+50

+0.5%

10.9K

total stars

#140

alexeygrigorev/data-science-interviews

A repository of data science interview questions and answers for developers.

+50

+0.5%

9.8K

total stars

HTML

#141

pubkey/rxdb

Reactive, local-first database for JavaScript apps with real-time sync and flexible storage

+49

+0.2%

23.1K

total stars

TypeScript

#142

rqlite/rqlite

A lightweight, fault-tolerant distributed database built on SQLite, designed for high availability.

+49

+0.3%

17.3K

total stars

#143

mattn/go-sqlite3

A lightweight SQLite3 driver for Go that implements the database/sql interface.

+49

+0.6%

9.0K

total stars

#144

veb-101/Data-Science-Projects

A collection of data science projects in Python using Jupyter Notebook.

+49

+2.0%

2.6K

total stars

Jupyter Notebook

#145

mukunku/ParquetViewer

A simple Windows desktop app for viewing and querying Apache Parquet files, a popular big data format.

+49

+4.6%

1.1K

total stars

#146

cstack/db_tutorial

A tutorial for writing a SQLite clone from scratch in C, a useful resource for developers building database-backed applications.

+48

+0.5%

10.3K

total stars

#147

plotters-rs/plotters

A high-quality, cross-platform data plotting library for Rust developers, including WebAssembly support.

+48

+1.1%

4.5K

total stars

Rust

#148

TobikoData/sqlmesh

Scalable and efficient data transformation framework with backwards compatibility for dbt.

+48

+1.7%

2.9K

total stars

Python

#149

elastic/kibana

Kibana is an open-source data visualization and management tool for Elasticsearch

+47

+0.2%

21.0K

total stars

TypeScript

#150

delta-io/delta

An open-source data lakehouse framework that enables building data pipelines with leading big data compute engines.

+46

+0.5%

8.6K

total stars

Scala

1 24...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.