Trending Projects

Discover the fastest growing open source projects

Showing 51-100 of 897 trending projects

#51
KeithGalli/pandas

A Python library for data manipulation and analysis, part of the core data science toolkit.

+699
+194.2%
1.1K
total stars
#52
1nchaos/adata

Open-source, free A-share quantitative trading data platform focused on China's stock market

+694
+20.9%
4.0K
total stars
#53
sqldef/sqldef

Idempotent schema management tool for MySQL, PostgreSQL, SQLite, and SQL Server databases.

+690
+29.7%
3.0K
total stars
#54
arpanghosh8453/garmin-grafana

A Python script to fetch Garmin health data and populate it in an InfluxDB database for visualization in Grafana.

+682
+30.6%
2.9K
total stars
#55
dagster-io/dagster

An open-source data orchestration platform for developing, running, and observing data pipelines and workflows.

+673
+4.7%
15.1K
total stars
#56
typesense/typesense

Fast, typo-tolerant search engine for building delightful search experiences

+623
+2.5%
25.3K
total stars
#57
juicedata/juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3 for big data and cloud-native applications.

+618
+4.9%
13.3K
total stars
#58
pymupdf/PyMuPDF

A high-performance Python library for data extraction, analysis, conversion and manipulation of PDF and other documents.

+618
+7.2%
9.2K
total stars
#59
facebook/rocksdb

Embeddable, persistent key-value store for fast storage with LSM design

+600
+1.9%
31.6K
total stars
#60
tursodatabase/libsql

libSQL is an open-source, open-contribution fork of SQLite, a widely used embedded database.

+593
+3.7%
16.4K
total stars
#61
opendataloader-project/opendataloader-pdf

Fast local PDF-to-Markdown/JSON converter for RAG pipelines. No GPU needed.

+591
+47.5%
1.8K
total stars
#62
apache/gravitino

An open-source data catalog platform for building a high-performance, federated metadata lake.

+582
+25.2%
2.9K
total stars
#63
vortex-data/vortex

An extensible, high-performance columnar file format for data storage and processing.

+576
+26.4%
2.8K
total stars
#64
dbt-labs/dbt-core

dbt enables data analysts and engineers to transform data using software engineering practices.

+573
+4.9%
12.3K
total stars
#65
chinese-poetry/chinese-poetry

Comprehensive Chinese poetry database with JSON-formatted data for developers

+566
+1.1%
51.0K
total stars
#66
moshi4/pyCirclize

A Python library for creating circular data visualizations like Circos plots, chord diagrams, and radar charts.

+556
+112.5%
1.1K
total stars
#67
TIBCOSoftware/snappydata

SnappyData is a memory-optimized analytics database based on Apache Spark and Apache Geode, enabling real-time stream processing, transactions, and predictive analytics.

+534
+106.2%
1.0K
total stars
#68
MariaDB/server

Open-source relational database management system (RDBMS) for building data-driven applications.

+525
+7.8%
7.3K
total stars
#69
go-gorm/gorm

GORM is a developer-friendly ORM library for Golang, offering features like associations, hooks, and auto migrations.

+524
+1.3%
39.7K
total stars
#70
rgeo/rgeo

A geospatial data library for Ruby that provides a set of tools for working with geographic data.

+517
+98.3%
1.0K
total stars
#71
qishibo/AnotherRedisDesktopManager

Redis desktop manager with GUI for managing Redis databases on Linux, Windows, Mac

+515
+1.5%
34.0K
total stars
#72
rstudio/pointblank

Data quality assessment and reporting tool for data frames and database tables in R

+510
+99.6%
1.0K
total stars
#73
jorgecarleitao/arrow2

A Rust library to work with the Arrow data format, without requiring the Transmute crate.

+509
+90.9%
1.1K
total stars
#74
pingcap/tidb

Cloud-native distributed SQL database for modern applications

+506
+1.3%
39.9K
total stars
#75
sqlite/sqlite

Official Git mirror of the SQLite source tree, a popular and widely-used embedded database engine.

+506
+5.9%
9.1K
total stars
#76
niderhoff/nlp-datasets

A curated list of free/public domain text datasets for natural language processing (NLP) tasks.

+504
+9.2%
6.0K
total stars
#77
Mrkuhuo/data-warehouse-learning

Open-source data warehouse learning project with examples and code for building real-time and offline data pipelines.

+504
+90.8%
1.1K
total stars
#78
theOehrly/Fast-F1

A Python package for accessing and analyzing Formula 1 racing data, including results, schedules, timing, and telemetry.

+502
+12.5%
4.5K
total stars
#79
dgraph-io/dgraph

High-performance distributed graph database for real-time use cases

+500
+2.4%
21.6K
total stars
#80
apache/doris

Apache Doris is a high-performance, unified analytics database for real-time data processing.

+494
+3.4%
15.1K
total stars
#81
google/leveldb

Fast key-value storage library for C++

+488
+1.3%
38.9K
total stars
#82
J535D165/recordlinkage

A powerful Python library for record linkage and duplicate detection in data-driven applications.

+487
+87.1%
1.0K
total stars
#83
influxdata/influxdb

Time-series database for metrics & analytics

+483
+1.6%
31.4K
total stars
#84
cockroachdb/cockroach

Distributed SQL database for cloud-native apps

+482
+1.5%
32.0K
total stars
#85
sqlitebrowser/sqlitebrowser

SQLite database management tool with GUI

+461
+2.0%
23.7K
total stars
#86
apache/datafusion

Apache DataFusion is a powerful SQL query engine written in Rust, designed for big data processing and analysis.

+454
+5.7%
8.5K
total stars
#87
dlt-hub/dlt

An open-source Python library that simplifies the process of loading data into data lakes and warehouses.

+453
+10.0%
5.0K
total stars
#88
PostgresApp/PostgresApp

An open-source PostgreSQL client application for macOS, providing an easy way to set up and manage a local PostgreSQL database.

+452
+6.2%
7.7K
total stars
#89
1eez/103976

A comprehensive English word database with translations, parts of speech, and definitions for developers.

+447
+79.0%
1.0K
total stars
#90
mongodb/mongo

MongoDB database server and tools

+445
+1.6%
28.2K
total stars
#91
vitessio/vitess

Distributed MySQL database system for horizontal scaling

+442
+2.2%
20.8K
total stars
#92
holistics/dbml

A database modeling language (DBML) that helps define and document database structures.

+438
+14.2%
3.5K
total stars
#93
MongoEngine/mongoengine

MongoEngine is a Python Object-Document-Mapper (ODM) for working with MongoDB databases.

+435
+11.1%
4.4K
total stars
#94
nalgeon/sqlean

The ultimate set of SQLite extensions for developers building applications with SQLite databases.

+431
+11.2%
4.3K
total stars
#95
trinodb/trino

Trino is a distributed SQL query engine for big data, allowing fast, scalable, and cost-effective analytics.

+428
+3.5%
12.6K
total stars
#96
paradedb/paradedb

A Rust-based, Elasticsearch-quality search engine for PostgreSQL, enabling fast, real-time analytics and HTAP use cases.

+426
+5.3%
8.5K
total stars
#97
StarRocks/starrocks

A high-performance open source query engine for sub-second analytics on data lakehouse.

+419
+3.8%
11.4K
total stars
#98
Micro-sheep/efinance

efinance is a Python library for quickly accessing financial data (funds, stocks, bonds, futures) and backtesting/quantitative trading.

+417
+14.2%
3.4K
total stars
#99
paulyoder/LinqToExcel

A library that allows developers to use LINQ to retrieve data from spreadsheets and CSV files.

+412
+63.2%
1.1K
total stars
#100
yhat/db.py

db.py is a Python library that provides an easier way to interact with your databases.

+411
+51.1%
1.2K
total stars
13...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.