Trending Projects

Discover the fastest growing open source projects

Showing 351-400 of 897 trending projects

#351

ron-rs/ron

A Rust library for serializing and deserializing data in the Rusty Object Notation (RON) format.

+0.0%

3.9K

total stars

Rust

#352

nalepae/pandarallel

A parallel processing library for Pandas that improves performance on multi-core CPUs.

+0.0%

3.8K

total stars

Python

#353

psycopg/psycopg2

A Python database adapter for PostgreSQL, allowing developers to interact with their databases.

+0.0%

3.6K

total stars

#354

Visualize-ML/Book2_Beauty-of-Data-Visualization

A collection of Jupyter Notebook files focused on data visualization and machine learning concepts.

+0.0%

3.6K

total stars

Jupyter Notebook

#355

dtinit/data-transfer-project

The Data Transfer Project enables direct transfer of user data between online service providers.

+0.0%

3.6K

total stars

Java

#356

camelot-dev/camelot

A Python library for extracting tabular data from PDF files, useful for data processing and analysis.

+0.0%

3.6K

total stars

Python

#357

frectonz/sql-studio

A SQL database explorer supporting multiple database engines like SQLite, PostgreSQL, and MySQL.

+0.0%

3.5K

total stars

Rust

#358

databricks/koalas

Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

+0.0%

3.4K

total stars

Python

#359

ClickHouse/clickhouse-go

A Go driver for the ClickHouse analytics database, enabling fast and efficient data processing.

+0.0%

3.3K

total stars

#360

ApsaraDB/PolarDB-for-PostgreSQL

A cloud-native PostgreSQL database developed by Alibaba Cloud for high-performance, scalable data storage and management.

+0.0%

3.1K

total stars

#361

man-group/arctic

A high-performance datastore for time series and tick data built on top of MongoDB.

+0.0%

3.1K

total stars

Python

#362

openmaptiles/openmaptiles

OpenMapTiles is an open-source vector tile schema implementation for creating custom map tiles.

+0.0%

3.0K

total stars

PLpgSQL

#363

datafold/data-diff

A Python library for comparing data across databases, supporting various database engines.

+0.0%

3.0K

total stars

Python

#364

uiwjs/province-city-china

Comprehensive dataset of China's administrative divisions (province, city, county, town) in JSON, CSV, and SQL formats.

+0.0%

3.0K

total stars

JavaScript

#365

igrigorik/gharchive.org

An open-source project that captures the public GitHub timeline and makes it accessible for analysis.

+0.0%

3.0K

total stars

Ruby

#366

gonum/plot

A Go library for creating high-quality plots and visualizations of data

+0.0%

2.9K

total stars

#367

TobikoData/sqlmesh

Scalable and efficient data transformation framework with backwards compatibility for dbt.

+0.0%

2.9K

total stars

Python

#368

ekzhu/datasketch

A Python library for data sketching techniques like MinHash, LSH, HyperLogLog, and HNSW for approximate similarity search.

+0.0%

2.9K

total stars

Python

#369

orbitinghail/sqlsync

Collaborative offline-first SQLite wrapper for syncing app state across users & devices

+0.0%

2.9K

total stars

Rust

#370

MakieOrg/Makie.jl

A powerful data visualization and plotting library for the Julia programming language.

+0.0%

2.7K

total stars

Julia

#371

chdb-io/chdb

An in-process OLAP SQL Engine powered by ClickHouse, enabling fast and efficient data analysis.

+0.0%

2.6K

total stars

C++

#372

Visualize-ML/Book6_First-Course-in-Data-Science

A book on data science, covering topics from basic math to machine learning using Python and Jupyter Notebooks.

+0.0%

2.6K

total stars

Jupyter Notebook

#373

colour-science/colour

A comprehensive Python library for color science and color space conversions.

+0.0%

2.5K

total stars

Python

#374

rilldata/rill

Rill is a tool for transforming data sets into powerful dashboards using SQL, enabling BI-as-code.

+0.0%

2.5K

total stars

#375

griddb/griddb

GridDB is a fast and scalable open-source database for time-series IoT and big data applications.

+0.0%

2.5K

total stars

C++

#376

armink/FlashDB

An ultra-lightweight database that supports key-value and time series data for embedded and IoT applications.

+0.0%

2.4K

total stars

#377

benedekrozemberczki/awesome-community-detection

A curated list of community detection research papers with implementations for data science and network analysis.

+0.0%

2.4K

total stars

Python

#378

lukes/ISO-3166-Countries-with-Regional-Codes

A comprehensive dataset of ISO 3166-1 country codes and their corresponding UN Geoscheme regional codes, ready to use in various formats.

+0.0%

2.4K

total stars

Ruby

#379

malloydata/malloy

Malloy is an open-source language for describing data relationships and transformations.

+0.0%

2.4K

total stars

TypeScript

#380

google/youtube-8m

Starter code for working with the YouTube-8M dataset, a large-scale video understanding dataset.

+0.0%

2.4K

total stars

Python

#381

VictoriaMetrics/fastcache

Fast in-memory cache library for Go with low GC overhead, optimized for a large number of entries.

+0.0%

2.3K

total stars

#382

binance/binance-public-data

A Python library to access historical market data from the Binance cryptocurrency exchange.

+0.0%

2.3K

total stars

Python

#383

timeplus-io/proton

Fast, single-binary C++ SQL ETL pipeline for stream processing, observability, analytics, and AI/ML.

+0.1%

2.2K

total stars

C++

#384

RJT1990/pyflux

Open source time series library for Python, useful for statistical analysis and modeling.

+0.1%

2.1K

total stars

Python

#385

konradhalas/dacite

A simple Python library for creating dataclasses from dictionaries.

+0.1%

2.0K

total stars

Python

#386

LastAncientOne/Stock_Analysis_For_Quant

A collection of stock analysis tools across various programming languages and platforms.

+0.1%

2.0K

total stars

Jupyter Notebook

#387

zarr-developers/zarr-python

An efficient and compressed N-dimensional array library for Python, useful for data scientists and ML engineers.

+0.1%

1.9K

total stars

Python

#388

mirage/irmin

Irmin is a distributed database that follows the same design principles as Git, allowing for distributed version control of data.

+0.1%

1.9K

total stars

OCaml

#389

baidu/tera

An Internet-scale distributed database system built on C++, inspired by Google's Bigtable.

+0.1%

1.9K

total stars

C++

#390

yhilpisch/py4fi

This is a Python library for financial applications, not a tool for AI-powered vibe coders.

+0.1%

1.9K

total stars

Jupyter Notebook

#391

data-engineering-community/data-engineering-wiki

A community-driven wiki for learning data engineering, covering topics like data modeling, pipelines, and databases.

+0.1%

1.9K

total stars

CSS

#392

fluid-cloudnative/fluid

Fluid is a distributed data abstraction and acceleration framework for Big Data and AI applications on the cloud.

+0.1%

1.9K

total stars

#393

johannfaouzi/pyts

A Python package for time series classification, useful for developers working with time-series data.

+0.1%

1.9K

total stars

Python

#394

neo4j-contrib/neo4j-apoc-procedures

A collection of procedures for the Neo4j graph database, providing advanced graph algorithms and utilities.

+0.1%

1.9K

total stars

Java

#395

cbailes/awesome-deep-trading

A curated list of resources for machine learning-based algorithmic trading and quantitative finance.

+0.1%

1.8K

total stars

#396

plant99/felicette

A Python library for processing and visualizing satellite imagery data.

+0.1%

1.8K

total stars

Python

#397

mwaskom/seaborn-data

This is a data repository for the Seaborn data visualization library in Python.

+0.1%

1.8K

total stars

Python

#398

risinglightdb/risinglight

An educational OLAP database system built in Rust for learning and experimentation.

+0.1%

1.8K

total stars

Rust

#399

mkazhdan/PoissonRecon

Poisson Surface Reconstruction is a C++ library for reconstructing surfaces from point cloud data.

+0.1%

1.8K

total stars

C++

#400

zalando/spilo

Highly available PostgreSQL cluster using Docker, focused on data infrastructure for developers.

+0.1%

1.8K

total stars

Python

1...79...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.