Trending Projects

Discover the fastest growing open source projects

Showing 101-150 of 897 trending projects

#101
TA-Lib/ta-lib-python

Python wrapper for the TA-Lib technical analysis library, useful for financial pattern recognition.

+963
+8.9%
11.8K
total stars
#102
efficient/cuckoofilter

A space-efficient C++ implementation of the Cuckoo filter, a probabilistic data structure for set membership testing.

+958
+1916.0%
1.0K
total stars
#103
dr5hn/countries-states-cities-database

A comprehensive database of countries, states, and cities with data in multiple formats

+947
+11.3%
9.3K
total stars
#104
GreptimeTeam/greptimedb

Open-source, cloud-native, unified observability database for metrics, logs and traces, supporting SQL/PromQL/Streaming.

+930
+18.3%
6.0K
total stars
#105
apache/flink

Apache Flink is a stream processing framework for real-time and batch data processing.

+929
+3.7%
25.8K
total stars
#106
MariaDB/server

Open-source relational database management system (RDBMS) for building data-driven applications.

+927
+14.6%
7.3K
total stars
#107
dexie/Dexie.js

Dexie.js is a minimalistic IndexedDB wrapper that simplifies offline storage and database management in web applications.

+918
+7.0%
14.1K
total stars
#108
networkx/networkx

networkx is a Python library for creating, manipulating, and studying the structure and dynamics of complex networks.

+902
+5.7%
16.7K
total stars
#109
waditu/tushare

A Python library for crawling historical data of China stocks.

+902
+6.6%
14.5K
total stars
#110
alexeygrigorev/data-science-interviews

A repository of data science interview questions and answers for developers.

+895
+10.1%
9.8K
total stars
#111
FavioVazquez/ds-cheatsheets

A comprehensive collection of data science cheatsheets for developers and data scientists.

+894
+5.8%
16.2K
total stars
#112
apache/gravitino

An open-source data catalog platform for building a high-performance, federated metadata lake.

+892
+44.7%
2.9K
total stars
#113
scylladb/scylladb

A high-performance NoSQL data store compatible with Apache Cassandra and Amazon DynamoDB.

+881
+6.1%
15.4K
total stars
#114
taosdata/TDengine

High-performance time-series database for IoT and IIoT

+856
+3.6%
24.8K
total stars
#115
debezium/debezium

An open-source framework for change data capture from various databases using Apache Kafka.

+854
+7.3%
12.5K
total stars
#116
moshi4/pyCirclize

A Python library for creating circular data visualizations like Circos plots, chord diagrams, and radar charts.

+851
+427.6%
1.1K
total stars
#117
TIBCOSoftware/snappydata

SnappyData is a memory-optimized analytics database based on Apache Spark and Apache Geode, enabling real-time stream processing, transactions, and predictive analytics.

+846
+442.9%
1.0K
total stars
#118
deepseek-ai/smallpond

A lightweight data processing framework built on DuckDB and 3FS for vibe coders working with AI tools.

+844
+20.6%
4.9K
total stars
#119
apache/fluss

Apache Fluss is a real-time streaming storage platform built for big data analytics.

+841
+86.7%
1.8K
total stars
#120
CodeCutTech/Efficient_Python_tricks_and_tools_for_data_scientists

A collection of efficient Python tricks and tools for data scientists to improve their productivity.

+837
+130.2%
1.5K
total stars
#121
ijl/orjson

Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy

+836
+11.8%
7.9K
total stars
#122
datahub-project/datahub

An open-source metadata platform for managing your data and AI stack across the enterprise.

+828
+7.7%
11.6K
total stars
#123
rgeo/rgeo

A geospatial data library for Ruby that provides a set of tools for working with geographic data.

+827
+382.9%
1.0K
total stars
#124
sqldef/sqldef

Idempotent schema management tool for MySQL, PostgreSQL, SQLite, and SQL Server databases.

+825
+37.6%
3.0K
total stars
#125
dhamaniasad/awesome-postgres

A curated list of awesome PostgreSQL software, libraries, tools and resources.

+822
+7.5%
11.7K
total stars
#126
jorgecarleitao/arrow2

A Rust library to work with the Arrow data format, without requiring the Transmute crate.

+822
+332.8%
1.1K
total stars
#127
Rockyzsu/stock

A Python library for quantitative trading and stock analysis.

+820
+12.9%
7.2K
total stars
#128
scipy/scipy

SciPy is a Python library for scientific and technical computing, providing a wide range of algorithms and tools.

+814
+6.0%
14.5K
total stars
#129
igorbarinov/awesome-data-engineering

A curated list of data engineering tools for software developers, not focused on AI coding tools.

+814
+10.8%
8.3K
total stars
#130
Mrkuhuo/data-warehouse-learning

Open-source data warehouse learning project with examples and code for building real-time and offline data pipelines.

+806
+318.6%
1.1K
total stars
#131
rstudio/pointblank

Data quality assessment and reporting tool for data frames and database tables in R

+805
+371.0%
1.0K
total stars
#132
apple/foundationdb

FoundationDB is an open-source, distributed, transactional key-value store that provides ACID guarantees.

+802
+5.2%
16.2K
total stars
#133
citusdata/citus

Citus is a distributed PostgreSQL database that enables scaling out your Postgres database across multiple nodes.

+795
+6.9%
12.3K
total stars
#134
J535D165/recordlinkage

A powerful Python library for record linkage and duplicate detection in data-driven applications.

+795
+316.7%
1.0K
total stars
#135
Micro-sheep/efinance

efinance is a Python library for quickly accessing financial data (funds, stocks, bonds, futures) and backtesting/quantitative trading.

+786
+30.5%
3.4K
total stars
#136
rqlite/rqlite

A lightweight, fault-tolerant distributed database built on SQLite, designed for high availability.

+759
+4.6%
17.3K
total stars
#137
1eez/103976

A comprehensive English word database with translations, parts of speech, and definitions for developers.

+758
+297.3%
1.0K
total stars
#138
argoproj/argo-workflows

Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.

+757
+4.8%
16.5K
total stars
#139
yhat/db.py

db.py is a Python library that provides an easier way to interact with your databases.

+757
+165.3%
1.2K
total stars
#140
red-data-tools/pycall.rb

A library for calling Python functions from the Ruby language, enabling data science and ML workflows.

+754
+211.2%
1.1K
total stars
#141
xo/dbtpl

A command-line tool to generate idiomatic Go code for SQL databases across multiple database engines.

+749
+23.9%
3.9K
total stars
#142
RhetTbull/osxphotos

A Python library to programmatically access and manage photos and metadata in the Apple Photos library on macOS.

+748
+28.5%
3.4K
total stars
#143
farzaa/gemini-bball

This is a Python library focused on basketball analytics and data processing.

+746
+181.1%
1.2K
total stars
#144
shashankvemuri/Finance

A comprehensive collection of 150+ Python programs for quantitative finance and stock market data analysis.

+744
+25.8%
3.6K
total stars
#145
dbgate/dbgate

Database manager for multiple database engines, runs as desktop or web app.

+739
+12.2%
6.8K
total stars
#146
pointfreeco/sqlite-data

A fast, lightweight SQLite-based persistence layer with CloudKit synchronization for Swift developers.

+735
+81.8%
1.6K
total stars
#147
eduosi/district

This repository contains data on Chinese administrative divisions, including names, pinyin, and codes.

+729
+211.9%
1.1K
total stars
#148
erikgrinaker/toydb

An educational distributed SQL database written in Rust, not focused on AI coding tools.

+723
+11.2%
7.2K
total stars
#149
paulyoder/LinqToExcel

A library that allows developers to use LINQ to retrieve data from spreadsheets and CSV files.

+712
+202.3%
1.1K
total stars
#150
cantaro86/Financial-Models-Numerical-Methods

A collection of notebooks covering quantitative finance and numerical methods in Python.

+710
+11.8%
6.7K
total stars
124...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.