Trending Projects

Discover the fastest growing open source projects

Showing 401-450 of 897 trending projects

#401
posit-dev/great-tables

A Python library for creating easy-to-use, visually appealing data tables and summaries.

+90
+3.6%
2.6K
total stars
#402
narwhals-dev/narwhals

Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.

+90
+6.2%
1.5K
total stars
#403
tidwall/btree

A high-performance B-tree implementation for Go, useful for building database-like applications.

+90
+8.1%
1.2K
total stars
#404
intake/intake

Intake is a lightweight Python package for discovering, investigating, loading and distributing data.

+90
+9.2%
1.1K
total stars
#405
gunnarmorling/awesome-opensource-data-engineering

An Awesome List of open-source data engineering projects for developers.

+89
+3.0%
3.0K
total stars
#406
mpquant/MyTT

A Python library with most common stock market technical indicators, making it easy to implement quantitative finance and algorithmic trading.

+89
+3.5%
2.6K
total stars
#407
binance/binance-public-data

A Python library to access historical market data from the Binance cryptocurrency exchange.

+89
+4.1%
2.3K
total stars
#408
beamandrew/medical-data

No description provided for this medical data repository.

+88
+1.5%
6.0K
total stars
#409
geekinglcq/CDCS

A collection of solutions to Chinese data competitions, primarily using Python.

+88
+5.2%
1.8K
total stars
#410
pentaho/mondrian

Mondrian is an OLAP server that enables real-time analysis of large data sets for business users.

+88
+8.2%
1.2K
total stars
#411
OHDSI/CommonDataModel

A definition and DDLs for the OMOP Common Data Model (CDM), a data model for healthcare data.

+88
+9.4%
1.0K
total stars
#412
google/draco

Draco is a C++ library for compressing and decompressing 3D geometric meshes and point clouds.

+87
+1.2%
7.2K
total stars
#413
gunrock/gunrock

Programmable CUDA/C++ GPU Graph Analytics library for high-performance parallel graph processing.

+87
+8.9%
1.1K
total stars
#414
electricitymaps/electricitymaps-contrib

An open-source repository for parsing electricity data and powering a comprehensive electricity data platform.

+85
+2.2%
4.0K
total stars
#415
orium/rpds

A Rust library that provides persistent data structures for efficient and immutable data management.

+85
+5.3%
1.7K
total stars
#416
erikgrinaker/toydb

An educational distributed SQL database written in Rust, not focused on AI coding tools.

+84
+1.2%
7.2K
total stars
#417
karlseguin/the-little-mongodb-book

A concise guide to the MongoDB NoSQL database for developers.

+84
+6.0%
1.5K
total stars
#418
nfstream/nfstream

NFStream is a flexible network data analysis framework for network monitoring, security, and traffic classification.

+84
+7.5%
1.2K
total stars
#419
josonle/Coding-Now

A collection of study notes, ebooks, and resources on big data, machine learning, Linux, and more for developers.

+84
+8.7%
1.0K
total stars
#420
cuge1995/awesome-time-series

A curated list of resources for time series forecasting, including papers, code, and other materials.

+84
+8.8%
1.0K
total stars
#421
skyzh/mini-lsm

A Rust-based implementation of an LSM-Tree storage engine (database) for developers to build and learn from.

+83
+2.2%
3.9K
total stars
#422
delight-im/FreeGeoDB

A free database of geographic place names and corresponding geospatial data for developers to use.

+83
+5.6%
1.6K
total stars
#423
huangzhibiao/BGFMDB

A simple Objective-C library that provides a one-line CRUD interface for SQLite databases on iOS.

+82
+6.0%
1.4K
total stars
#424
movingpandas/movingpandas

A Python library for analyzing movement trajectory data using GeoPandas.

+82
+6.3%
1.4K
total stars
#425
slashbase/slashbaseide

Modern database IDE for dev & data workflows, supporting MySQL, PostgreSQL & MongoDB.

+82
+6.7%
1.3K
total stars
#426
lakekeeper/lakekeeper

Lakekeeper is an open-source, secure, and fast Apache Iceberg REST Catalog written in Rust for data lakehouse governance.

+82
+7.3%
1.2K
total stars
#427
spacejam/sled

A high-performance, concurrent, embedded key-value database written in Rust for vibe coders.

+81
+0.9%
8.9K
total stars
#428
ContextLab/hypertools

A Python toolbox for gaining geometric insights into high-dimensional data, useful for vibe coders working with AI tools.

+80
+4.4%
1.9K
total stars
#429
huachaohuang/awesome-dbdev

A curated list of awesome materials and resources for database development.

+80
+5.3%
1.6K
total stars
#430
scijs/ndarray

A JavaScript library for working with multidimensional arrays, useful for data visualization and scientific computing.

+80
+6.9%
1.2K
total stars
#431
re-data/re-data

A data quality and observability tool for monitoring and fixing data issues before they become problems.

+79
+5.3%
1.6K
total stars
#432
redisson/redisson

Redisson is a Java client for Redis and Valkey with distributed objects and services

+78
+0.3%
24.3K
total stars
#433
apache/arrow-rs

Official Rust implementation of the Apache Arrow data format for efficient data processing and storage.

+78
+2.4%
3.4K
total stars
#434
dask/dask

Dask is a Python library for parallel computing and distributed data processing, providing a scalable alternative to NumPy and Pandas.

+77
+0.6%
13.8K
total stars
#435
cmu-db/noisepage

Self-Driving Database Management System from Carnegie Mellon University

+77
+4.6%
1.8K
total stars
#436
PyTables/PyTables

A powerful Python package to manage and work with extremely large amounts of data.

+77
+6.0%
1.4K
total stars
#437
kevwan/go-stash

A high-performance, open-source data processing pipeline for ingesting Kafka data and sending it to Elasticsearch.

+77
+6.8%
1.2K
total stars
#438
farzaa/gemini-bball

This is a Python library focused on basketball analytics and data processing.

+77
+7.1%
1.2K
total stars
#439
knex/knex

SQL query builder for multiple databases

+76
+0.4%
20.2K
total stars
#440
cstack/db_tutorial

A tutorial for writing a SQLite clone from scratch in C, a useful resource for developers building database-backed applications.

+76
+0.7%
10.3K
total stars
#441
pyvista/pyvista

A Python library for 3D plotting and mesh analysis using the Visualization Toolkit (VTK)

+76
+2.2%
3.6K
total stars
#442
tcgoetz/GarminDB

A Python library for downloading, parsing, and analyzing health data from Garmin, FitBit, and MS Health.

+76
+2.6%
2.9K
total stars
#443
Tencent/paxosstore

PaxosStore is a high-performance, distributed database solution built for large-scale applications.

+76
+4.6%
1.7K
total stars
#444
CamDavidsonPilon/lifetimes

A Python library for calculating customer lifetime value metrics and cohort analysis.

+76
+5.4%
1.5K
total stars
#445
PyO3/rust-numpy

Rust-based bindings for the NumPy C-API, enabling developers to leverage Rust for numerical computing.

+76
+6.0%
1.3K
total stars
#446
pomber/covid19

A public dataset of daily COVID-19 cases and deaths per country, useful for data analysis and visualization.

+76
+6.7%
1.2K
total stars
#447
sql-js/sql.js

A JavaScript library that allows you to run SQLite on the web, enabling local database functionality for web apps.

+75
+0.6%
13.6K
total stars
#448
VictoriaMetrics/fastcache

Fast in-memory cache library for Go with low GC overhead, optimized for a large number of entries.

+75
+3.3%
2.3K
total stars
#449
wainshine/Company-Names-Corpus

A corpus of company names, abbreviations, and brands that can be used for Chinese text segmentation and entity recognition.

+75
+6.2%
1.3K
total stars
#450
elliotchance/orderedmap

An ordered map implementation in Go with amortized O(1) performance for common operations.

+75
+8.0%
1.0K
total stars
1...810...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.