Trending Projects

Discover the fastest growing open source projects

Showing 201-250 of 897 trending projects

#201
mattn/go-sqlite3

A lightweight SQLite3 driver for Go that implements the database/sql interface.

+513
+6.0%
9.0K
total stars
#202
DataBrewery/cubes

A lightweight Python OLAP framework for multi-dimensional data analysis and reporting.

+510
+52.6%
1.5K
total stars
#203
skyzh/mini-lsm

A Rust-based implementation of an LSM-Tree storage engine (database) for developers to build and learn from.

+507
+14.8%
3.9K
total stars
#204
rapidsai/cudf

A high-performance GPU DataFrame library for data analysis and machine learning workloads.

+506
+5.6%
9.5K
total stars
#205
orioledb/orioledb

OrioleDB is a cloud-native PostgreSQL extension that solves performance and scalability challenges.

+501
+14.3%
4.0K
total stars
#206
iamseancheney/python_for_data_analysis_2nd_chinese_version

A Chinese translation of a popular book on using Python for data analysis with libraries like pandas and numpy.

+497
+6.0%
8.8K
total stars
#207
pawelsalawa/sqlitestudio

A free, open-source SQLite database manager for multiple platforms.

+493
+8.3%
6.4K
total stars
#208
Data-Centric-AI-Community/ydata-profiling

A Python library for fast, customizable, and interactive data profiling and exploratory data analysis.

+492
+3.8%
13.4K
total stars
#209
dunwu/db-tutorial

An in-depth tutorial covering mainstream database knowledge for backend developers.

+490
+10.2%
5.3K
total stars
#210
IQSS/dataverse

Open source research data repository software built with Java.

+485
+89.7%
1.0K
total stars
#211
valeriansaliou/sonic

Fast, lightweight search backend alternative to Elasticsearch

+483
+2.3%
21.2K
total stars
#212
apache/shardingsphere

Distributed SQL database middleware for sharding, scalability, and security

+482
+2.4%
20.7K
total stars
#213
microsoft/sql-server-samples

This repository contains code samples for SQL Server, Azure SQL, and related data services from Microsoft.

+477
+4.6%
10.9K
total stars
#214
aimeos/upscheme

A database migration and schema management tool for PHP developers, supporting multiple database engines.

+472
+22.1%
2.6K
total stars
#215
knex/knex

SQL query builder for multiple databases

+470
+2.4%
20.2K
total stars
#216
sequelize/sequelize

ORM for Node.js/TypeScript with multiple database support

+467
+1.6%
30.3K
total stars
#217
OSGeo/gdal

GDAL is an open-source library for working with various geospatial data formats, useful for remote sensing and GIS applications.

+462
+8.7%
5.8K
total stars
#218
pudo/dataset

Easy-to-use data handling for SQL data stores with support for implicit table creation, bulk loading, and transactions.

+460
+10.5%
4.9K
total stars
#219
RedisTimeSeries/RedisTimeSeries

A Redis module that provides a time series data structure for storing and querying time series data.

+458
+75.3%
1.1K
total stars
#220
enhancedformysql/The-Art-of-Problem-Solving-in-Software-Engineering_How-to-Make-MySQL-Better

This repository provides a comprehensive guide on optimizing MySQL performance and solving common database problems.

+456
+31.1%
1.9K
total stars
#221
LAStools/LAStools

This repository contains efficient tools for LiDAR processing, focused on working with point cloud data.

+455
+78.0%
1.0K
total stars
#222
synthetichealth/synthea

Synthea is an open-source synthetic patient population simulator for generating realistic healthcare data.

+453
+17.7%
3.0K
total stars
#223
liuhuanyong/QASystemOnMedicalKG

A tutorial and implementation of a disease-centered medical knowledge graph and QA system.

+450
+6.7%
7.2K
total stars
#224
blaze/odo

A Python library for data migration and transformation in the Blaze project.

+450
+81.1%
1.0K
total stars
#225
Tencent/wcdb

WCDB is a cross-platform database framework developed by WeChat for Android, iOS, Linux, macOS, and Windows.

+448
+4.0%
11.7K
total stars
#226
typicode/lowdb

Lightweight local JSON database for JavaScript/TypeScript apps

+446
+2.0%
22.5K
total stars
#227
PRQL/prql

PRQL is a modern, powerful, and pipelined SQL replacement for transforming data.

+442
+4.3%
10.7K
total stars
#228
kedro-org/kedro

Kedro is a Python toolkit for building production-ready data science and machine learning pipelines.

+441
+4.3%
10.8K
total stars
#229
opengeospatial/geoparquet

A specification for storing geospatial vector data (point, line, polygon) in the Parquet file format, enabling efficient cloud-native geospatial data processing.

+438
+74.6%
1.0K
total stars
#230
feldera/feldera

The Feldera Incremental Computation Engine is a Rust-based library for building real-time data pipelines and materialized views.

+435
+31.5%
1.8K
total stars
#231
benbjohnson/thesecretlivesofdata

A JavaScript library for visualizing and understanding complex data structures.

+433
+13.6%
3.6K
total stars
#232
lakekeeper/lakekeeper

Lakekeeper is an open-source, secure, and fast Apache Iceberg REST Catalog written in Rust for data lakehouse governance.

+432
+55.7%
1.2K
total stars
#233
hannorein/rebound

An open-source N-body simulation library for astrophysics and planetary science.

+432
+70.7%
1.0K
total stars
#234
cmu-db/bustub

An educational relational database management system (RDBMS) implementation in C++.

+431
+9.7%
4.9K
total stars
#235
narwhals-dev/narwhals

Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.

+431
+38.6%
1.5K
total stars
#236
spacejam/sled

A high-performance, concurrent, embedded key-value database written in Rust for vibe coders.

+429
+5.0%
8.9K
total stars
#237
gunnarmorling/awesome-opensource-data-engineering

An Awesome List of open-source data engineering projects for developers.

+429
+16.5%
3.0K
total stars
#238
karlseguin/the-little-mongodb-book

A concise guide to the MongoDB NoSQL database for developers.

+424
+39.7%
1.5K
total stars
#239
petewarden/dstk

A collection of open data sets and tools for data science and machine learning tasks.

+424
+59.2%
1.1K
total stars
#240
apache/cassandra

Apache Cassandra is a distributed, wide-column store database system designed for high availability, scalability, and performance.

+423
+4.6%
9.6K
total stars
#241
jorgerojas26/lazysql

A cross-platform TUI database management tool written in Go for developers working with databases.

+423
+13.5%
3.5K
total stars
#242
OpenRefine/OpenRefine

OpenRefine is a powerful data cleaning and transformation tool that helps developers work with messy data.

+422
+3.7%
11.8K
total stars
#243
brandon-rhodes/pycon-pandas-tutorial

A tutorial for using the popular Python data analysis library Pandas, presented at PyCon 2015.

+414
+63.0%
1.1K
total stars
#244
XTXMarkets/ternfs

An exabyte-scale, multi-region distributed file system for developers building AI-powered applications.

+413
+47.3%
1.3K
total stars
#245
cstack/db_tutorial

A tutorial for writing a SQLite clone from scratch in C, a useful resource for developers building database-backed applications.

+412
+4.2%
10.3K
total stars
#246
rilldata/rill

Rill is a tool for transforming data sets into powerful dashboards using SQL, enabling BI-as-code.

+403
+19.1%
2.5K
total stars
#247
liyupi/sql-mother

A free, interactive SQL learning platform with an online SQL editor, real-time query results, and syntax highlighting.

+401
+11.1%
4.0K
total stars
#248
veb-101/Data-Science-Projects

A collection of data science projects in Python using Jupyter Notebook.

+401
+18.6%
2.6K
total stars
#249
mgramin/awesome-db-tools

A curated list of awesome database tools and resources to make working with databases easier.

+400
+8.7%
5.0K
total stars
#250
Jon-Becker/prediction-market-analysis

Framework for collecting and analyzing prediction market data with comprehensive Polymarket/Kalshi datasets.

+400
+23.2%
2.1K
total stars
1...46...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.