Trending Projects

Discover the fastest growing open source projects

Showing 101-150 of 897 trending projects

#101
eduosi/district

This repository contains data on Chinese administrative divisions, including names, pinyin, and codes.

+410
+61.8%
1.1K
total stars
#102
shashankvemuri/Finance

A comprehensive collection of 150+ Python programs for quantitative finance and stock market data analysis.

+407
+12.6%
3.6K
total stars
#103
Jon-Becker/prediction-market-analysis

Framework for collecting and analyzing prediction market data with comprehensive Polymarket/Kalshi datasets.

+400
+23.2%
2.1K
total stars
#104
typeorm/typeorm

ORM for TypeScript and JavaScript with support for multiple databases and platforms.

+397
+1.1%
36.4K
total stars
#105
pointfreeco/sqlite-data

A fast, lightweight SQLite-based persistence layer with CloudKit synchronization for Swift developers.

+397
+32.1%
1.6K
total stars
#106
google/or-tools

Google's Operations Research tools for combinatorial optimization, linear programming, and operations research.

+396
+3.1%
13.2K
total stars
#107
questdb/questdb

QuestDB is a high-performance, open-source, time-series database for real-time analytics and financial applications.

+390
+2.4%
16.7K
total stars
#108
ranaroussi/quantstats

Portfolio analytics library for quantitative finance, built with Python

+390
+6.1%
6.8K
total stars
#109
treeverse/dvc

dvc is a data versioning and ML experiments tool that helps developers manage and track data and model changes.

+389
+2.6%
15.4K
total stars
#110
alexeygrigorev/data-science-interviews

A repository of data science interview questions and answers for developers.

+389
+4.1%
9.8K
total stars
#111
BrambleXu/pydata-notebook

A collection of Jupyter Notebook files for data analysis using Python, including a Chinese translation of the popular 'Python for Data Analysis' book.

+389
+9.1%
4.7K
total stars
#112
dataprofessor/code

Compilation of R and Python programming codes for data science and machine learning projects.

+389
+60.8%
1.0K
total stars
#113
waditu/tushare

A Python library for crawling historical data of China stocks.

+388
+2.7%
14.5K
total stars
#114
apache/iceberg

Apache Iceberg is an open-source table format for large analytic datasets, providing a versioned and scalable data lake architecture.

+388
+4.7%
8.6K
total stars
#115
XTXMarkets/ternfs

An exabyte-scale, multi-region distributed file system for developers building AI-powered applications.

+381
+42.1%
1.3K
total stars
#116
snowplow/snowplow

A powerful customer data pipeline for collecting, processing, and analyzing user events and behavior.

+379
+5.7%
7.0K
total stars
#117
gee-community/geemap

A Python package for interactive geospatial analysis and visualization with Google Earth Engine.

+376
+10.7%
3.9K
total stars
#118
alandefreitas/matplotplusplus

Matplot++: A C++ graphics library for creating high-quality data visualizations and scientific plots.

+375
+8.4%
4.8K
total stars
#119
redis-windows/redis-windows

Redis 6.0.20 through 8.0.0 for Windows, a popular open-source in-memory data structure store.

+374
+11.9%
3.5K
total stars
#120
hugo2046/QuantsPlaybook

A quantitative research and stock analysis platform for finance professionals.

+370
+8.9%
4.5K
total stars
#121
tidwall/buntdb

BuntDB is an embeddable, in-memory key/value database for Go with custom indexing and geospatial support.

+369
+8.3%
4.8K
total stars
#122
debezium/debezium

An open-source framework for change data capture from various databases using Apache Kafka.

+364
+3.0%
12.5K
total stars
#123
kuzudb/kuzu

Fast, embedded graph database with vector search and full-text search, compatible with Cypher queries.

+364
+10.8%
3.7K
total stars
#124
timescale/pgvectorscale

A Postgres extension for high-performance vector search, complementing pgvector for scale.

+360
+14.2%
2.9K
total stars
#125
Kotlin/dataframe

A Kotlin library for structured data processing, suitable for data analysis and data science tasks.

+360
+53.6%
1.0K
total stars
#126
apache/arrow

Apache Arrow is a fast columnar data format and toolset for in-memory analytics and data interchange.

+359
+2.2%
16.6K
total stars
#127
zhu-xlab/GlobalBuildingAtlas

GlobalBuildingAtlas is an open global and complete dataset of building polygons, heights and LoD1 3D models.

+357
+21.9%
2.0K
total stars
#128
the-pudding/data

A repository of open-source data sets created for stories on The Pudding, a digital publication focused on data journalism.

+357
+51.3%
1.1K
total stars
#129
nalepae/pandarallel

A parallel processing library for Pandas that improves performance on multi-core CPUs.

+356
+10.3%
3.8K
total stars
#130
dedupeio/dedupe

A Python library for accurate and scalable fuzzy matching, record deduplication, and entity resolution.

+355
+8.7%
4.4K
total stars
#131
markwk/qs_ledger

A personal data aggregator and analysis tool for self-tracking and quantified self enthusiasts.

+350
+49.6%
1.1K
total stars
#132
apache/flink

Apache Flink is a stream processing framework for real-time and batch data processing.

+346
+1.4%
25.8K
total stars
#133
mpquant/Ashare

A free, open-source Python library for fetching real-time stock data from Chinese stock exchanges.

+343
+12.1%
3.2K
total stars
#134
rapidsai/cudf

A high-performance GPU DataFrame library for data analysis and machine learning workloads.

+342
+3.7%
9.5K
total stars
#135
opengeos/streamlit-geospatial

A multi-page Streamlit app for geospatial data visualization and analysis, useful for housing and real estate applications.

+342
+51.2%
1.0K
total stars
#136
upper/db

A data access layer (DAL) and ORM-like library for working with SQL and NoSQL databases in Go.

+339
+10.3%
3.6K
total stars
#137
igorbarinov/awesome-data-engineering

A curated list of data engineering tools for software developers, not focused on AI coding tools.

+337
+4.2%
8.3K
total stars
#138
xo/dbtpl

A command-line tool to generate idiomatic Go code for SQL databases across multiple database engines.

+335
+9.4%
3.9K
total stars
#139
taynaud/python-louvain

A Python library for implementing the Louvain community detection algorithm on graphs.

+335
+47.7%
1.0K
total stars
#140
nutsdb/nutsdb

A simple, fast, and embeddable key-value store written in Go that supports transactions and data structures.

+334
+10.3%
3.6K
total stars
#141
TA-Lib/ta-lib-python

Python wrapper for the TA-Lib technical analysis library, useful for financial pattern recognition.

+333
+2.9%
11.8K
total stars
#142
RhetTbull/osxphotos

A Python library to programmatically access and manage photos and metadata in the Apple Photos library on macOS.

+333
+11.0%
3.4K
total stars
#143
networkx/networkx

networkx is a Python library for creating, manipulating, and studying the structure and dynamics of complex networks.

+329
+2.0%
16.7K
total stars
#144
dr5hn/countries-states-cities-database

A comprehensive database of countries, states, and cities with data in multiple formats

+328
+3.6%
9.3K
total stars
#145
apache/cassandra

Apache Cassandra is a distributed, wide-column store database system designed for high availability, scalability, and performance.

+325
+3.5%
9.6K
total stars
#146
documentdb/documentdb

MongoDB-compatible database engine for cloud-native and open-source workloads with scalability and performance.

+325
+11.2%
3.2K
total stars
#147
dhamaniasad/awesome-postgres

A curated list of awesome PostgreSQL software, libraries, tools and resources.

+321
+2.8%
11.7K
total stars
#148
datahub-project/datahub

An open-source metadata platform for managing your data and AI stack across the enterprise.

+318
+2.8%
11.6K
total stars
#149
fjall-rs/fjall

A high-performance, embeddable key-value storage engine written in Rust for developers building data-intensive applications.

+318
+19.8%
1.9K
total stars
#150
scipy/scipy

SciPy is a Python library for scientific and technical computing, providing a wide range of algorithms and tools.

+314
+2.2%
14.5K
total stars
124...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.