Trending Projects

Discover the fastest growing open source projects

Showing 101-150 of 897 trending projects

#101

facebook/rocksdb

Embeddable, persistent key-value store for fast storage with LSM design

+293

+0.9%

31.6K

total stars

C++

#102

mongodb/mongo

MongoDB database server and tools

+289

+1.0%

28.2K

total stars

C++

#103

multiprocessio/datastation

A versatile app for querying, scripting, and visualizing data from various databases, files, and APIs.

+289

+10.8%

3.0K

total stars

TypeScript

#104

qishibo/AnotherRedisDesktopManager

Redis desktop manager with GUI for managing Redis databases on Linux, Windows, Mac

+287

+0.8%

34.0K

total stars

JavaScript

#105

sqlitebrowser/sqlitebrowser

SQLite database management tool with GUI

+287

+1.2%

23.7K

total stars

C++

#106

linhandev/dataset

A comprehensive index of medical imaging datasets for researchers and developers working in the medical imaging field.

+285

+8.9%

3.5K

total stars

#107

Micro-sheep/efinance

efinance is a Python library for quickly accessing financial data (funds, stocks, bonds, futures) and backtesting/quantitative trading.

+284

+9.2%

3.4K

total stars

Python

#108

Wisser/Jailer

A Java-based database subsetting and relational data browsing tool for popular databases.

+277

+9.7%

3.1K

total stars

Java

#109

CLUEbenchmark/CLUEDatasetSearch

A comprehensive search tool for finding Chinese NLP datasets, with support for common English NLP datasets as well.

+276

+6.7%

4.4K

total stars

Python

#110

fjall-rs/fjall

A high-performance, embeddable key-value storage engine written in Rust for developers building data-intensive applications.

+275

+16.7%

1.9K

total stars

Rust

#111

nalgeon/redka

A Redis-compatible database implemented in Go, supporting SQL and multiple backends like PostgreSQL and SQLite.

+270

+6.3%

4.5K

total stars

#112

DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

+265

+7.7%

3.7K

total stars

Java

#113

RhetTbull/osxphotos

A Python library to programmatically access and manage photos and metadata in the Apple Photos library on macOS.

+264

+8.5%

3.4K

total stars

Python

#114

nalgeon/sqlean

The ultimate set of SQLite extensions for developers building applications with SQLite databases.

+261

+6.5%

4.3K

total stars

#115

ranaroussi/quantstats

Portfolio analytics library for quantitative finance, built with Python

+260

+4.0%

6.8K

total stars

Python

#116

shashankvemuri/Finance

A comprehensive collection of 150+ Python programs for quantitative finance and stock market data analysis.

+260

+7.7%

3.6K

total stars

Python

#117

google/leveldb

Fast key-value storage library for C++

+255

+0.7%

38.9K

total stars

C++

#118

moshi4/pyCirclize

A Python library for creating circular data visualizations like Circos plots, chord diagrams, and radar charts.

+253

+31.7%

1.1K

total stars

Python

#119

rapidsai/cudf

A high-performance GPU DataFrame library for data analysis and machine learning workloads.

+252

+2.7%

9.5K

total stars

C++

#120

canonical/dqlite

An embeddable, replicated, and fault-tolerant SQL engine for building robust and scalable applications.

+251

+6.2%

4.3K

total stars

#121

apache/hamilton

Hamilton is an open-source ETL framework that helps data scientists and engineers build modular, testable dataflows with lineage and metadata.

+251

+11.6%

2.4K

total stars

Jupyter Notebook

#122

timescale/pgvectorscale

A Postgres extension for high-performance vector search, complementing pgvector for scale.

+250

+9.4%

2.9K

total stars

Rust

#123

ijl/orjson

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

+246

+3.2%

7.9K

total stars

Python

#124

vortex-data/vortex

An extensible, high-performance columnar file format for data storage and processing.

+246

+9.8%

2.8K

total stars

Rust

#125

apache/datafusion

Apache DataFusion is a powerful SQL query engine written in Rust, designed for big data processing and analysis.

+242

+2.9%

8.5K

total stars

Rust

#126

StarRocks/starrocks

A high-performance open source query engine for sub-second analytics on data lakehouse.

+241

+2.1%

11.4K

total stars

Java

#127

jorgecarleitao/arrow2

A Rust library to work with the Arrow data format, without requiring the Transmute crate.

+241

+29.1%

1.1K

total stars

Rust

#128

hugo2046/QuantsPlaybook

A quantitative research and stock analysis platform for finance professionals.

+240

+5.6%

4.5K

total stars

Jupyter Notebook

#129

TIBCOSoftware/snappydata

SnappyData is a memory-optimized analytics database based on Apache Spark and Apache Geode, enabling real-time stream processing, transactions, and predictive analytics.

+238

+29.8%

1.0K

total stars

Scala

#130

mpquant/Ashare

A free, open-source Python library for fetching real-time stock data from Chinese stock exchanges.

+236

+8.0%

3.2K

total stars

Python

#131

rgeo/rgeo

A geospatial data library for Ruby that provides a set of tools for working with geographic data.

+236

+29.2%

1.0K

total stars

Ruby

#132

apache/doris

Apache Doris is a high-performance, unified analytics database for real-time data processing.

+234

+1.6%

15.1K

total stars

Java

#133

waditu/tushare

A Python library for crawling historical data of China stocks.

+234

+1.6%

14.5K

total stars

Python

#134

zemirco/json2csv

Convert JSON to CSV with column titles

+233

+9.3%

2.7K

total stars

JavaScript

#135

vitessio/vitess

Distributed MySQL database system for horizontal scaling

+232

+1.1%

20.8K

total stars

#136

fluvio-community/fluvio

Fluvio is an event stream processing engine for developers to build responsive data-intensive apps.

+232

+4.7%

5.2K

total stars

Rust

#137

rstudio/pointblank

Data quality assessment and reporting tool for data frames and database tables in R

+232

+29.4%

1.0K

total stars

#138

uiwjs/province-city-china

Comprehensive dataset of China's administrative divisions (province, city, county, town) in JSON, CSV, and SQL formats.

+230

+8.3%

3.0K

total stars

JavaScript

#139

trinodb/trino

Trino is a distributed SQL query engine for big data, allowing fast, scalable, and cost-effective analytics.

+229

+1.9%

12.6K

total stars

Java

#140

rob-med/awesome-TS-anomaly-detection

A curated list of tools and datasets for anomaly detection on time-series data.

+229

+7.8%

3.2K

total stars

#141

google/or-tools

Google's Operations Research tools for combinatorial optimization, linear programming, and operations research.

+228

+1.8%

13.2K

total stars

C++

#142

tidyverse/dplyr

dplyr is a powerful R library for data manipulation, providing a grammar of data manipulation.

+228

+4.8%

5.0K

total stars

#143

Mrkuhuo/data-warehouse-learning

Open-source data warehouse learning project with examples and code for building real-time and offline data pipelines.

+227

+27.3%

1.1K

total stars

Java

#144

PyWavelets/pywt

PyWavelets is a Python library for wavelet transform algorithms and techniques, useful for image and signal processing.

+226

+10.7%

2.3K

total stars

Python

#145

J535D165/recordlinkage

A powerful Python library for record linkage and duplicate detection in data-driven applications.

+226

+27.6%

1.0K

total stars

Python

#146

jtablesaw/tablesaw

A high-performance Java library for data analysis, visualization, and machine learning.

+225

+6.4%

3.7K

total stars

Java

#147

upper/db

A data access layer (DAL) and ORM-like library for working with SQL and NoSQL databases in Go.

+221

+6.5%

3.6K

total stars

#148

apache/arrow

Apache Arrow is a fast columnar data format and toolset for in-memory analytics and data interchange.

+219

+1.3%

16.6K

total stars

C++

#149

apache/iceberg

Apache Iceberg is an open-source table format for large analytic datasets, providing a versioned and scalable data lake architecture.

+218

+2.6%

8.6K

total stars

Java

#150

dlt-hub/dlt

An open-source Python library that simplifies the process of loading data into data lakes and warehouses.

+217

+4.5%

5.0K

total stars

Python

1 24...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.