Trending Projects

Discover the fastest growing open source projects

Showing 101-150 of 897 trending projects

#101
facebook/rocksdb

Embeddable, persistent key-value store for fast storage with LSM design

+293
+0.9%
31.6K
total stars
#102
mongodb/mongo

MongoDB database server and tools

+289
+1.0%
28.2K
total stars
#103
multiprocessio/datastation

A versatile app for querying, scripting, and visualizing data from various databases, files, and APIs.

+289
+10.8%
3.0K
total stars
#104
qishibo/AnotherRedisDesktopManager

Redis desktop manager with GUI for managing Redis databases on Linux, Windows, Mac

+287
+0.8%
34.0K
total stars
#105
sqlitebrowser/sqlitebrowser

SQLite database management tool with GUI

+287
+1.2%
23.7K
total stars
#106
linhandev/dataset

A comprehensive index of medical imaging datasets for researchers and developers working in the medical imaging field.

+285
+8.9%
3.5K
total stars
#107
Micro-sheep/efinance

efinance is a Python library for quickly accessing financial data (funds, stocks, bonds, futures) and backtesting/quantitative trading.

+284
+9.2%
3.4K
total stars
#108
Wisser/Jailer

A Java-based database subsetting and relational data browsing tool for popular databases.

+277
+9.7%
3.1K
total stars
#109
CLUEbenchmark/CLUEDatasetSearch

A comprehensive search tool for finding Chinese NLP datasets, with support for common English NLP datasets as well.

+276
+6.7%
4.4K
total stars
#110
fjall-rs/fjall

A high-performance, embeddable key-value storage engine written in Rust for developers building data-intensive applications.

+275
+16.7%
1.9K
total stars
#111
nalgeon/redka

A Redis-compatible database implemented in Go, supporting SQL and multiple backends like PostgreSQL and SQLite.

+270
+6.3%
4.5K
total stars
#112
DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

+265
+7.7%
3.7K
total stars
#113
RhetTbull/osxphotos

A Python library to programmatically access and manage photos and metadata in the Apple Photos library on macOS.

+264
+8.5%
3.4K
total stars
#114
nalgeon/sqlean

The ultimate set of SQLite extensions for developers building applications with SQLite databases.

+261
+6.5%
4.3K
total stars
#115
ranaroussi/quantstats

Portfolio analytics library for quantitative finance, built with Python

+260
+4.0%
6.8K
total stars
#116
shashankvemuri/Finance

A comprehensive collection of 150+ Python programs for quantitative finance and stock market data analysis.

+260
+7.7%
3.6K
total stars
#117
google/leveldb

Fast key-value storage library for C++

+255
+0.7%
38.9K
total stars
#118
moshi4/pyCirclize

A Python library for creating circular data visualizations like Circos plots, chord diagrams, and radar charts.

+253
+31.7%
1.1K
total stars
#119
rapidsai/cudf

A high-performance GPU DataFrame library for data analysis and machine learning workloads.

+252
+2.7%
9.5K
total stars
#120
canonical/dqlite

An embeddable, replicated, and fault-tolerant SQL engine for building robust and scalable applications.

+251
+6.2%
4.3K
total stars
#121
apache/hamilton

Hamilton is an open-source ETL framework that helps data scientists and engineers build modular, testable dataflows with lineage and metadata.

+251
+11.6%
2.4K
total stars
#122
timescale/pgvectorscale

A Postgres extension for high-performance vector search, complementing pgvector for scale.

+250
+9.4%
2.9K
total stars
#123
ijl/orjson

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

+246
+3.2%
7.9K
total stars
#124
vortex-data/vortex

An extensible, high-performance columnar file format for data storage and processing.

+246
+9.8%
2.8K
total stars
#125
apache/datafusion

Apache DataFusion is a powerful SQL query engine written in Rust, designed for big data processing and analysis.

+242
+2.9%
8.5K
total stars
#126
StarRocks/starrocks

A high-performance open source query engine for sub-second analytics on data lakehouse.

+241
+2.1%
11.4K
total stars
#127
jorgecarleitao/arrow2

A Rust library to work with the Arrow data format, without requiring the Transmute crate.

+241
+29.1%
1.1K
total stars
#128
hugo2046/QuantsPlaybook

A quantitative research and stock analysis platform for finance professionals.

+240
+5.6%
4.5K
total stars
#129
TIBCOSoftware/snappydata

SnappyData is a memory-optimized analytics database based on Apache Spark and Apache Geode, enabling real-time stream processing, transactions, and predictive analytics.

+238
+29.8%
1.0K
total stars
#130
mpquant/Ashare

A free, open-source Python library for fetching real-time stock data from Chinese stock exchanges.

+236
+8.0%
3.2K
total stars
#131
rgeo/rgeo

A geospatial data library for Ruby that provides a set of tools for working with geographic data.

+236
+29.2%
1.0K
total stars
#132
apache/doris

Apache Doris is a high-performance, unified analytics database for real-time data processing.

+234
+1.6%
15.1K
total stars
#133
waditu/tushare

A Python library for crawling historical data of China stocks.

+234
+1.6%
14.5K
total stars
#134
zemirco/json2csv

Convert JSON to CSV with column titles

+233
+9.3%
2.7K
total stars
#135
vitessio/vitess

Distributed MySQL database system for horizontal scaling

+232
+1.1%
20.8K
total stars
#136
fluvio-community/fluvio

Fluvio is an event stream processing engine for developers to build responsive data-intensive apps.

+232
+4.7%
5.2K
total stars
#137
rstudio/pointblank

Data quality assessment and reporting tool for data frames and database tables in R

+232
+29.4%
1.0K
total stars
#138
uiwjs/province-city-china

Comprehensive dataset of China's administrative divisions (province, city, county, town) in JSON, CSV, and SQL formats.

+230
+8.3%
3.0K
total stars
#139
trinodb/trino

Trino is a distributed SQL query engine for big data, allowing fast, scalable, and cost-effective analytics.

+229
+1.9%
12.6K
total stars
#140
rob-med/awesome-TS-anomaly-detection

A curated list of tools and datasets for anomaly detection on time-series data.

+229
+7.8%
3.2K
total stars
#141
google/or-tools

Google's Operations Research tools for combinatorial optimization, linear programming, and operations research.

+228
+1.8%
13.2K
total stars
#142
tidyverse/dplyr

dplyr is a powerful R library for data manipulation, providing a grammar of data manipulation.

+228
+4.8%
5.0K
total stars
#143
Mrkuhuo/data-warehouse-learning

Open-source data warehouse learning project with examples and code for building real-time and offline data pipelines.

+227
+27.3%
1.1K
total stars
#144
PyWavelets/pywt

PyWavelets is a Python library for wavelet transform algorithms and techniques, useful for image and signal processing.

+226
+10.7%
2.3K
total stars
#145
J535D165/recordlinkage

A powerful Python library for record linkage and duplicate detection in data-driven applications.

+226
+27.6%
1.0K
total stars
#146
jtablesaw/tablesaw

A high-performance Java library for data analysis, visualization, and machine learning.

+225
+6.4%
3.7K
total stars
#147
upper/db

A data access layer (DAL) and ORM-like library for working with SQL and NoSQL databases in Go.

+221
+6.5%
3.6K
total stars
#148
apache/arrow

Apache Arrow is a fast columnar data format and toolset for in-memory analytics and data interchange.

+219
+1.3%
16.6K
total stars
#149
apache/iceberg

Apache Iceberg is an open-source table format for large analytic datasets, providing a versioned and scalable data lake architecture.

+218
+2.6%
8.6K
total stars
#150
dlt-hub/dlt

An open-source Python library that simplifies the process of loading data into data lakes and warehouses.

+217
+4.5%
5.0K
total stars
124...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.