Trending Projects

Discover the fastest growing open source projects

Showing 351-400 of 897 trending projects

#351
ron-rs/ron

A Rust library for serializing and deserializing data in the Rusty Object Notation (RON) format.

+1
+0.0%
3.9K
total stars
#352
nalepae/pandarallel

A parallel processing library for Pandas that improves performance on multi-core CPUs.

+1
+0.0%
3.8K
total stars
#353
psycopg/psycopg2

A Python database adapter for PostgreSQL, allowing developers to interact with their databases.

+1
+0.0%
3.6K
total stars
#354
Visualize-ML/Book2_Beauty-of-Data-Visualization

A collection of Jupyter Notebook files focused on data visualization and machine learning concepts.

+1
+0.0%
3.6K
total stars
#355
dtinit/data-transfer-project

The Data Transfer Project enables direct transfer of user data between online service providers.

+1
+0.0%
3.6K
total stars
#356
camelot-dev/camelot

A Python library for extracting tabular data from PDF files, useful for data processing and analysis.

+1
+0.0%
3.6K
total stars
#357
frectonz/sql-studio

A SQL database explorer supporting multiple database engines like SQLite, PostgreSQL, and MySQL.

+1
+0.0%
3.5K
total stars
#358
databricks/koalas

Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

+1
+0.0%
3.4K
total stars
#359
ClickHouse/clickhouse-go

A Go driver for the ClickHouse analytics database, enabling fast and efficient data processing.

+1
+0.0%
3.3K
total stars
#360
ApsaraDB/PolarDB-for-PostgreSQL

A cloud-native PostgreSQL database developed by Alibaba Cloud for high-performance, scalable data storage and management.

+1
+0.0%
3.1K
total stars
#361
man-group/arctic

A high-performance datastore for time series and tick data built on top of MongoDB.

+1
+0.0%
3.1K
total stars
#362
openmaptiles/openmaptiles

OpenMapTiles is an open-source vector tile schema implementation for creating custom map tiles.

+1
+0.0%
3.0K
total stars
#363
datafold/data-diff

A Python library for comparing data across databases, supporting various database engines.

+1
+0.0%
3.0K
total stars
#364
uiwjs/province-city-china

Comprehensive dataset of China's administrative divisions (province, city, county, town) in JSON, CSV, and SQL formats.

+1
+0.0%
3.0K
total stars
#365
igrigorik/gharchive.org

An open-source project that captures the public GitHub timeline and makes it accessible for analysis.

+1
+0.0%
3.0K
total stars
#366
gonum/plot

A Go library for creating high-quality plots and visualizations of data

+1
+0.0%
2.9K
total stars
#367
TobikoData/sqlmesh

Scalable and efficient data transformation framework with backwards compatibility for dbt.

+1
+0.0%
2.9K
total stars
#368
ekzhu/datasketch

A Python library for data sketching techniques like MinHash, LSH, HyperLogLog, and HNSW for approximate similarity search.

+1
+0.0%
2.9K
total stars
#369
orbitinghail/sqlsync

Collaborative offline-first SQLite wrapper for syncing app state across users & devices

+1
+0.0%
2.9K
total stars
#370
MakieOrg/Makie.jl

A powerful data visualization and plotting library for the Julia programming language.

+1
+0.0%
2.7K
total stars
#371
chdb-io/chdb

An in-process OLAP SQL Engine powered by ClickHouse, enabling fast and efficient data analysis.

+1
+0.0%
2.6K
total stars
#372
Visualize-ML/Book6_First-Course-in-Data-Science

A book on data science, covering topics from basic math to machine learning using Python and Jupyter Notebooks.

+1
+0.0%
2.6K
total stars
#373
colour-science/colour

A comprehensive Python library for color science and color space conversions.

+1
+0.0%
2.5K
total stars
#374
rilldata/rill

Rill is a tool for transforming data sets into powerful dashboards using SQL, enabling BI-as-code.

+1
+0.0%
2.5K
total stars
#375
griddb/griddb

GridDB is a fast and scalable open-source database for time-series IoT and big data applications.

+1
+0.0%
2.5K
total stars
#376
armink/FlashDB

An ultra-lightweight database that supports key-value and time series data for embedded and IoT applications.

+1
+0.0%
2.4K
total stars
#377
benedekrozemberczki/awesome-community-detection

A curated list of community detection research papers with implementations for data science and network analysis.

+1
+0.0%
2.4K
total stars
#378
lukes/ISO-3166-Countries-with-Regional-Codes

A comprehensive dataset of ISO 3166-1 country codes and their corresponding UN Geoscheme regional codes, ready to use in various formats.

+1
+0.0%
2.4K
total stars
#379
malloydata/malloy

Malloy is an open-source language for describing data relationships and transformations.

+1
+0.0%
2.4K
total stars
#380
google/youtube-8m

Starter code for working with the YouTube-8M dataset, a large-scale video understanding dataset.

+1
+0.0%
2.4K
total stars
#381
VictoriaMetrics/fastcache

Fast in-memory cache library for Go with low GC overhead, optimized for a large number of entries.

+1
+0.0%
2.3K
total stars
#382
binance/binance-public-data

A Python library to access historical market data from the Binance cryptocurrency exchange.

+1
+0.0%
2.3K
total stars
#383
timeplus-io/proton

Fast, single-binary C++ SQL ETL pipeline for stream processing, observability, analytics, and AI/ML.

+1
+0.1%
2.2K
total stars
#384
RJT1990/pyflux

Open source time series library for Python, useful for statistical analysis and modeling.

+1
+0.1%
2.1K
total stars
#385
konradhalas/dacite

A simple Python library for creating dataclasses from dictionaries.

+1
+0.1%
2.0K
total stars
#386
LastAncientOne/Stock_Analysis_For_Quant

A collection of stock analysis tools across various programming languages and platforms.

+1
+0.1%
2.0K
total stars
#387
zarr-developers/zarr-python

An efficient and compressed N-dimensional array library for Python, useful for data scientists and ML engineers.

+1
+0.1%
1.9K
total stars
#388
mirage/irmin

Irmin is a distributed database that follows the same design principles as Git, allowing for distributed version control of data.

+1
+0.1%
1.9K
total stars
#389
baidu/tera

An Internet-scale distributed database system built on C++, inspired by Google's Bigtable.

+1
+0.1%
1.9K
total stars
#390
yhilpisch/py4fi

This is a Python library for financial applications, not a tool for AI-powered vibe coders.

+1
+0.1%
1.9K
total stars
#391
data-engineering-community/data-engineering-wiki

A community-driven wiki for learning data engineering, covering topics like data modeling, pipelines, and databases.

+1
+0.1%
1.9K
total stars
#392
fluid-cloudnative/fluid

Fluid is a distributed data abstraction and acceleration framework for Big Data and AI applications on the cloud.

+1
+0.1%
1.9K
total stars
#393
johannfaouzi/pyts

A Python package for time series classification, useful for developers working with time-series data.

+1
+0.1%
1.9K
total stars
#394
neo4j-contrib/neo4j-apoc-procedures

A collection of procedures for the Neo4j graph database, providing advanced graph algorithms and utilities.

+1
+0.1%
1.9K
total stars
#395
cbailes/awesome-deep-trading

A curated list of resources for machine learning-based algorithmic trading and quantitative finance.

+1
+0.1%
1.8K
total stars
#396
plant99/felicette

A Python library for processing and visualizing satellite imagery data.

+1
+0.1%
1.8K
total stars
#397
mwaskom/seaborn-data

This is a data repository for the Seaborn data visualization library in Python.

+1
+0.1%
1.8K
total stars
#398
risinglightdb/risinglight

An educational OLAP database system built in Rust for learning and experimentation.

+1
+0.1%
1.8K
total stars
#399
mkazhdan/PoissonRecon

Poisson Surface Reconstruction is a C++ library for reconstructing surfaces from point cloud data.

+1
+0.1%
1.8K
total stars
#400
zalando/spilo

Highly available PostgreSQL cluster using Docker, focused on data infrastructure for developers.

+1
+0.1%
1.8K
total stars
1...79...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.