Trending Projects

Discover the fastest growing open source projects

Showing 451-500 of 897 trending projects

#451
lk-geimfari/mimesis

Mimesis is a fast Python library for generating fake data in multiple languages for testing and development purposes.

+79
+1.7%
4.8K
total stars
#452
oracle-samples/oracle-db-examples

This repository provides code examples for Oracle's AI-enabled database features and integrations.

+79
+6.0%
1.4K
total stars
#453
ydb-platform/ydb

An open-source distributed SQL database with high availability, scalability, and ACID transactions.

+78
+1.7%
4.7K
total stars
#454
ptyadana/SQL-Data-Analysis-and-Visualization-Projects

This GitHub repository contains SQL data analysis and visualization projects using various tools and databases.

+78
+4.9%
1.7K
total stars
#455
fivethirtyeight/data

A data repository for the data journalism site FiveThirtyEight, containing data and code behind their articles and graphics.

+77
+0.5%
17.3K
total stars
#456
zhisheng17/flink-learning

This is a comprehensive learning resource for the Flink stream processing framework, covering concepts, principles, and real-world use cases.

+77
+0.5%
15.1K
total stars
#457
nalgeon/sqlean

The ultimate set of SQLite extensions for developers building applications with SQLite databases.

+77
+1.8%
4.3K
total stars
#458
avhz/RustQuant

A Rust library for quantitative finance, including tools for machine learning, option pricing, and trading.

+77
+4.9%
1.7K
total stars
#459
deepseek-ai/smallpond

A lightweight data processing framework built on DuckDB and 3FS for vibe coders working with AI tools.

+76
+1.6%
4.9K
total stars
#460
tcgoetz/GarminDB

A Python library for downloading, parsing, and analyzing health data from Garmin, FitBit, and MS Health.

+76
+2.6%
2.9K
total stars
#461
objectbox/objectbox-go

Embedded Go Database, a fast open-source NoSQL database solution for Go projects.

+76
+6.4%
1.3K
total stars
#462
cmu-db/ottertune

An automatic DBMS configuration tool for optimizing database performance.

+76
+6.6%
1.2K
total stars
#463
ddotta/awesome-polars

A curated list of Polars, an open-source, high-performance data manipulation library for Python and Rust.

+76
+7.7%
1.1K
total stars
#464
apache/pinot

Apache Pinot is a realtime distributed OLAP datastore for fast querying of large datasets.

+75
+1.3%
6.0K
total stars
#465
olric-data/olric

Olric is a distributed, in-memory key/value store and cache for Go applications and services.

+75
+2.2%
3.4K
total stars
#466
bytewax/bytewax

Bytewax is a Python library for building scalable, fault-tolerant, and low-latency data processing pipelines.

+75
+4.0%
2.0K
total stars
#467
babyfish-ct/jimmer

An advanced ORM library for Java and Kotlin developers that provides powerful caching and data management features.

+75
+4.8%
1.6K
total stars
#468
orbitinghail/graft

Graft is an open-source transactional storage engine optimized for lazy, partial, and strongly consistent replication, ideal for edge, offline-first, and distributed applications.

+75
+5.6%
1.4K
total stars
#469
dbt-labs/dbt-utils

Utility functions for dbt projects, a popular data transformation tool for data engineers.

+74
+4.5%
1.7K
total stars
#470
briatte/awesome-network-analysis

A curated list of awesome resources for network analysis and visualization, with a focus on R tools.

+73
+1.9%
4.0K
total stars
#471
MakieOrg/Makie.jl

A powerful data visualization and plotting library for the Julia programming language.

+73
+2.8%
2.7K
total stars
#472
submato/xhscrawl

A web scraping tool for collecting data from Xiaohongshu, Bilibili, and other Chinese social platforms.

+73
+6.2%
1.3K
total stars
#473
supermarin/ObjectiveRecord

ActiveRecord-like API for CoreData, a powerful object-relational mapping (ORM) for iOS development.

+72
+5.9%
1.3K
total stars
#474
samayo/country-json

A simple JSON data set of country information, useful for building apps that need country data.

+72
+6.7%
1.1K
total stars
#475
canonical/dqlite

An embeddable, replicated, and fault-tolerant SQL engine for building robust and scalable applications.

+71
+1.7%
4.3K
total stars
#476
DataLinkDC/dinky

Dinky is a real-time data development platform based on Apache Flink, enabling agile data development, deployment and operation.

+71
+1.9%
3.7K
total stars
#477
chdb-io/chdb

An in-process OLAP SQL Engine powered by ClickHouse, enabling fast and efficient data analysis.

+71
+2.8%
2.6K
total stars
#478
colour-science/colour

A comprehensive Python library for color science and color space conversions.

+71
+2.9%
2.5K
total stars
#479
oetiker/rrdtool-1.x

RRDtool is a time-series database system for efficiently storing and graphing data.

+71
+7.0%
1.1K
total stars
#480
hosseinmoein/DataFrame

C++ DataFrame library for statistical, financial, and machine learning analysis.

+70
+2.5%
2.9K
total stars
#481
mirage/irmin

Irmin is a distributed database that follows the same design principles as Git, allowing for distributed version control of data.

+70
+3.8%
1.9K
total stars
#482
pgvector/pgvector-python

A Python library that provides support for the pgvector vector database, enabling efficient vector search and storage.

+70
+5.1%
1.4K
total stars
#483
LibRaw/LibRaw

LibRaw is a C++ library for reading RAW image files from digital cameras.

+70
+5.1%
1.4K
total stars
#484
bashtage/linearmodels

This Python library provides additional linear models for statistical modeling and analysis.

+70
+7.3%
1.0K
total stars
#485
spandanb/learndb-py

A Python library that implements database internals from scratch, useful for learning database concepts.

+69
+5.5%
1.3K
total stars
#486
jtv/libpqxx

The official C++ client API for PostgreSQL, providing a high-level interface for interacting with PostgreSQL databases.

+69
+5.7%
1.3K
total stars
#487
moby/datakit

Connect processes into powerful data pipelines with a simple git-like filesystem interface

+69
+6.7%
1.1K
total stars
#488
gaarason/database-all

Eloquent ORM for Java 8, 11, 17, 21, 23 and Spring boot 2.x, 3.x

+69
+6.8%
1.1K
total stars
#489
avehtari/BDA_py_demos

Provides Bayesian data analysis demos in Python for developers interested in probabilistic modeling.

+69
+7.1%
1.0K
total stars
#490
kurrent-io/KurrentDB

KurrentDB is an event-native database designed for modern software and event-driven architectures.

+68
+1.2%
5.7K
total stars
#491
datawhalechina/competition-baseline

A collection of code examples and baselines for common data science and machine learning competitions.

+68
+1.5%
4.7K
total stars
#492
apache/cloudberry

Open-source massively parallel processing (MPP) database, an alternative to Greenplum.

+68
+6.1%
1.2K
total stars
#493
hail-is/hail

Cloud-native genomic dataframes and batch computing for bioinformatics and genetics research.

+68
+6.9%
1.1K
total stars
#494
ideawu/ssdb

SSDB is a fast NoSQL database, an alternative to Redis, with support for leveldb and rocksdb backends.

+67
+0.8%
8.5K
total stars
#495
datalevin/datalevin

A simple, fast and versatile Datalog database written in Clojure for vibe coders.

+67
+5.1%
1.4K
total stars
#496
ChawlaAvi/Daily-Dose-of-Data-Science

A collection of code snippets and tutorials for data science and data analysis in Python.

+67
+6.1%
1.2K
total stars
#497
openspout/openspout

A fast and scalable library for reading and writing spreadsheet files (CSV, XLSX, ODS) in PHP.

+67
+6.5%
1.1K
total stars
#498
paulmach/orb

A Go library with types and utilities for working with 2D geometry, geospatial data, and mapping.

+67
+6.5%
1.1K
total stars
#499
linq2db/linq2db

Linq to database provider for .NET, supporting various database engines.

+66
+2.1%
3.2K
total stars
#500
kayak/pypika

PyPika is a Python SQL query builder that provides a readable, Pythonic syntax for constructing complex SQL queries.

+66
+2.3%
2.9K
total stars
1...911...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.