Trending Projects

Discover the fastest growing open source projects

Showing 251-300 of 897 trending projects

#251
treeverse/lakeFS

lakeFS is a Git-like version control system for data lakes, enabling data engineers to manage data versioning and data quality.

+399
+8.3%
5.2K
total stars
#252
mpquant/MyTT

A Python library with most common stock market technical indicators, making it easy to implement quantitative finance and algorithmic trading.

+397
+18.0%
2.6K
total stars
#253
pyvista/pyvista

A Python library for 3D plotting and mesh analysis using the Visualization Toolkit (VTK)

+396
+12.6%
3.5K
total stars
#254
zvtvz/zvt

A modular quantitative trading framework for algorithmic trading, backtesting, and financial analysis.

+395
+11.0%
4.0K
total stars
#255
linhandev/dataset

A comprehensive index of medical imaging datasets for researchers and developers working in the medical imaging field.

+395
+12.8%
3.5K
total stars
#256
Automattic/mongoose

Mongoose is a MongoDB object modeling tool for Node.js and Deno, simplifying database interactions with schemas and models.

+394
+1.5%
27.5K
total stars
#257
thinh-vu/vnstock

A beginner-friendly Python toolkit for financial data extraction, analysis, and automation.

+387
+50.3%
1.2K
total stars
#258
mybatis/mybatis-3

MyBatis SQL Mapper for Java simplifies database interactions with object mapping.

+386
+1.9%
20.4K
total stars
#259
sql-js/sql.js

A JavaScript library that allows you to run SQLite on the web, enabling local database functionality for web apps.

+385
+2.9%
13.6K
total stars
#260
delta-io/delta-rs

A Rust library for interacting with Delta Lake, a data lake storage format, with Python bindings.

+383
+13.8%
3.2K
total stars
#261
mage-ai/mage-ai

mage-ai is a Python-based platform for building, running, and managing data pipelines and integrating/transforming data.

+382
+4.6%
8.7K
total stars
#262
josonle/Coding-Now

A collection of study notes, ebooks, and resources on big data, machine learning, Linux, and more for developers.

+381
+57.3%
1.0K
total stars
#263
google/draco

Draco is a C++ library for compressing and decompressing 3D geometric meshes and point clouds.

+380
+5.6%
7.2K
total stars
#264
tonbo-io/tonbo

Tonbo is an embedded database for serverless and edge runtimes, optimized for offline-first and big data use cases.

+377
+33.5%
1.5K
total stars
#265
apache/arrow-rs

Official Rust implementation of the Apache Arrow data format for efficient data processing and storage.

+376
+12.5%
3.4K
total stars
#266
intake/intake

Intake is a lightweight Python package for discovering, investigating, loading and distributing data.

+375
+54.0%
1.1K
total stars
#267
lit26/finvizfinance

A Python library for financial analysis and data scraping from the Finviz platform.

+374
+43.9%
1.2K
total stars
#268
wangzhiwubigdata/God-Of-BigData

A comprehensive collection of resources and learning materials for big data technologies like Flink, Spark, Hadoop, and Hive.

+373
+3.7%
10.4K
total stars
#269
gunrock/gunrock

Programmable CUDA/C++ GPU Graph Analytics library for high-performance parallel graph processing.

+369
+52.8%
1.1K
total stars
#270
litedb-org/LiteDB

LiteDB is a lightweight, embedded NoSQL document database for .NET applications that can be used in a single data file.

+364
+4.0%
9.4K
total stars
#271
cuge1995/awesome-time-series

A curated list of resources for time series forecasting, including papers, code, and other materials.

+364
+53.7%
1.0K
total stars
#272
PeerDB-io/peerdb

Fast, cost-effective data replication tool from Postgres to data warehouses, queues, and storage

+362
+13.7%
3.0K
total stars
#273
ydb-platform/ydb

An open-source distributed SQL database with high availability, scalability, and ACID transactions.

+361
+8.3%
4.7K
total stars
#274
msiemens/tinydb

A lightweight, document-oriented database optimized for happiness, used as a Python library or CLI.

+359
+5.0%
7.5K
total stars
#275
apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark.

+358
+12.6%
3.2K
total stars
#276
xiangyuecn/AreaCity-JsSpider-StatsGov

Comprehensive collection of city and administrative region data for China, with features like CSV export, JS code generation, and web scraping.

+355
+5.9%
6.4K
total stars
#277
Visualize-ML/Book2_Beauty-of-Data-Visualization

A collection of Jupyter Notebook files focused on data visualization and machine learning concepts.

+355
+10.9%
3.6K
total stars
#278
prestodb/presto

Presto is an open-source distributed SQL query engine for big data, allowing fast analysis of large datasets.

+352
+2.2%
16.7K
total stars
#279
probberechts/soccerdata

A Python library for scraping soccer data from various sources for sports analytics and data science.

+351
+28.3%
1.6K
total stars
#280
tidyverse/readr

A fast and flexible R package for reading flat files (CSV, TSV, fixed-width) into R data frames.

+351
+51.9%
1.0K
total stars
#281
ContextLab/hypertools

A Python toolbox for gaining geometric insights into high-dimensional data, useful for vibe coders working with AI tools.

+349
+22.7%
1.9K
total stars
#282
plotters-rs/plotters

A high-quality, cross-platform data plotting library for Rust developers, including WebAssembly support.

+344
+8.2%
4.5K
total stars
#283
golang/leveldb

The LevelDB key-value database in the Go programming language.

+343
+42.2%
1.2K
total stars
#284
pentaho/pentaho-kettle

Pentaho Data Integration (ETL) is a Java-based tool for building data integration and ETL pipelines.

+339
+4.3%
8.3K
total stars
#285
apache/beam

Apache Beam is a unified programming model for batch and streaming data processing.

+335
+4.1%
8.5K
total stars
#286
skfolio/skfolio

A Python library for portfolio optimization using scikit-learn and convex optimization techniques.

+335
+21.5%
1.9K
total stars
#287
pixiedust/pixiedust

A Python helper library for enhancing Jupyter Notebooks with data visualization and analysis capabilities.

+331
+46.7%
1.0K
total stars
#288
man-group/ArcticDB

ArcticDB is a high-performance, serverless DataFrame database for the Python data science ecosystem.

+330
+17.6%
2.2K
total stars
#289
frectonz/sql-studio

A SQL database explorer supporting multiple database engines like SQLite, PostgreSQL, and MySQL.

+329
+10.4%
3.5K
total stars
#290
elliotchance/orderedmap

An ordered map implementation in Go with amortized O(1) performance for common operations.

+326
+47.2%
1.0K
total stars
#291
vlcn-io/cr-sqlite

A Rust library that provides multi-writer and CRDT support for SQLite databases.

+324
+9.8%
3.6K
total stars
#292
binance/binance-public-data

A Python library to access historical market data from the Binance cryptocurrency exchange.

+321
+16.6%
2.3K
total stars
#293
OHDSI/CommonDataModel

A definition and DDLs for the OMOP Common Data Model (CDM), a data model for healthcare data.

+320
+45.6%
1.0K
total stars
#294
deanmalmgren/textract

A Python library that provides a simple and unified interface for extracting text from any document format.

+317
+7.6%
4.5K
total stars
#295
rhiever/Data-Analysis-and-Machine-Learning-Projects

A collection of data analysis and machine learning projects and resources for developers.

+313
+4.9%
6.6K
total stars
#296
posit-dev/great-tables

A Python library for creating easy-to-use, visually appealing data tables and summaries.

+313
+13.6%
2.6K
total stars
#297
sqldelight/sqldelight

SQLDelight - Generates type-safe Kotlin APIs from SQL, enabling easier database management in Kotlin projects.

+310
+4.8%
6.8K
total stars
#298
lerocha/chinook-database

Sample database for SQL Server, Oracle, MySQL, PostgreSQL, SQLite, DB2

+305
+14.2%
2.5K
total stars
#299
duckdb/duckdb-wasm

WebAssembly version of the DuckDB analytical database, enabling fast in-browser analytics and SQL queries.

+304
+18.7%
1.9K
total stars
#300
apache/couchdb

An open-source, scalable, and fault-tolerant NoSQL database with a focus on reliability and offline-first design.

+303
+4.6%
6.8K
total stars
1...57...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.