Trending Projects

Discover the fastest growing open source projects

Showing 151-200 of 897 trending projects

#151
msiemens/tinydb

A lightweight, document-oriented database optimized for happiness, used as a Python library or CLI.

+46
+0.6%
7.5K
total stars
#152
rougier/scientific-visualization-book

An open-access book on scientific visualization using Python and Matplotlib for data-driven developers

+45
+0.4%
11.2K
total stars
#153
dbeaver/cloudbeaver

Cloud-based database manager UI for querying, managing, and visualizing databases across multiple platforms.

+45
+1.0%
4.7K
total stars
#154
huggingface/datatrove

A Python library that provides a set of customizable pipeline processing blocks for data processing tasks.

+45
+1.6%
2.9K
total stars
#155
mpquant/MyTT

A Python library with most common stock market technical indicators, making it easy to implement quantitative finance and algorithmic trading.

+45
+1.8%
2.6K
total stars
#156
mootdx/mootdx

A Python library for conveniently reading data from the Tongdaxin financial data platform.

+45
+3.4%
1.4K
total stars
#157
mysql/mysql-server

Open-source relational database engine powering web apps, APIs, and data-driven backends worldwide.

+44
+0.4%
12.2K
total stars
#158
pyvista/pyvista

A Python library for 3D plotting and mesh analysis using the Visualization Toolkit (VTK)

+44
+1.3%
3.5K
total stars
#159
tikv/tikv

Distributed transactional key-value database, originally created to complement TiDB

+43
+0.3%
16.6K
total stars
#160
cmu-db/bustub

An educational relational database management system (RDBMS) implementation in C++.

+43
+0.9%
4.9K
total stars
#161
fjall-rs/fjall

A high-performance, embeddable key-value storage engine written in Rust for developers building data-intensive applications.

+43
+2.3%
1.9K
total stars
#162
andkret/Cookbook

A comprehensive cookbook for data engineers, covering best practices, big data, and data engineering concepts.

+42
+0.3%
15.0K
total stars
#163
pawelsalawa/sqlitestudio

A free, open-source SQLite database manager for multiple platforms.

+42
+0.7%
6.4K
total stars
#164
meltano/meltano

Meltano is a declarative, code-first data integration engine for building and scaling data and ML-powered products.

+42
+1.8%
2.4K
total stars
#165
NateScarlet/holiday-cn

A Python tool for automatically scraping data on China's statutory holidays from government announcements.

+42
+2.3%
1.8K
total stars
#166
apache/seatunnel

A high-performance, distributed data integration tool for batch, streaming, and CDC use cases.

+41
+0.5%
9.1K
total stars
#167
frectonz/sql-studio

A SQL database explorer supporting multiple database engines like SQLite, PostgreSQL, and MySQL.

+41
+1.2%
3.5K
total stars
#168
Data-Centric-AI-Community/ydata-profiling

A Python library for fast, customizable, and interactive data profiling and exploratory data analysis.

+40
+0.3%
13.4K
total stars
#169
liyupi/sql-mother

A free, interactive SQL learning platform with an online SQL editor, real-time query results, and syntax highlighting.

+40
+1.0%
4.0K
total stars
#170
wesm/msgvault

Archive, search, and analyze your entire email/chat history offline with DuckDB-powered analytics and AI queries.

+40
+3.2%
1.3K
total stars
#171
OSGeo/gdal

GDAL is an open-source library for working with various geospatial data formats, useful for remote sensing and GIS applications.

+39
+0.7%
5.8K
total stars
#172
liam-hq/liam

Automatically generates beautiful and easy-to-read ER diagrams from your database.

+39
+0.8%
4.7K
total stars
#173
deanmalmgren/textract

A Python library that provides a simple and unified interface for extracting text from any document format.

+39
+0.9%
4.5K
total stars
#174
orioledb/orioledb

OrioleDB is a cloud-native PostgreSQL extension that solves performance and scalability challenges.

+39
+1.0%
4.0K
total stars
#175
apache/arrow-rs

Official Rust implementation of the Apache Arrow data format for efficient data processing and storage.

+39
+1.2%
3.4K
total stars
#176
binance/binance-public-data

A Python library to access historical market data from the Binance cryptocurrency exchange.

+39
+1.8%
2.3K
total stars
#177
narwhals-dev/narwhals

Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.

+39
+2.6%
1.5K
total stars
#178
rhiever/Data-Analysis-and-Machine-Learning-Projects

A collection of data analysis and machine learning projects and resources for developers.

+38
+0.6%
6.6K
total stars
#179
nullptrlabs/pgmodeler

An open-source data modeling tool designed for PostgreSQL, allowing developers to generate DDL commands visually.

+38
+1.1%
3.5K
total stars
#180
apache/parquet-format

Apache Parquet Format, a columnar data storage format used in the Apache Hadoop ecosystem.

+38
+1.7%
2.3K
total stars
#181
apache/fluss

Apache Fluss is a real-time streaming storage platform built for big data analytics.

+38
+2.1%
1.8K
total stars
#182
dbt-labs/metricflow

MetricFlow allows developers to define, build, and maintain metrics in code for business intelligence and analytics.

+38
+2.6%
1.5K
total stars
#183
CliMA/Oceananigans.jl

A fast, flexible, ocean-flavored fluid dynamics library for climate and ocean modeling on CPUs and GPUs.

+38
+3.0%
1.3K
total stars
#184
apache/cassandra

Apache Cassandra is a distributed, wide-column store database system designed for high availability, scalability, and performance.

+37
+0.4%
9.6K
total stars
#185
sql-js/sql.js

A JavaScript library that allows you to run SQLite on the web, enabling local database functionality for web apps.

+36
+0.3%
13.6K
total stars
#186
google/draco

Draco is a C++ library for compressing and decompressing 3D geometric meshes and point clouds.

+36
+0.5%
7.2K
total stars
#187
mgramin/awesome-db-tools

A curated list of awesome database tools and resources to make working with databases easier.

+36
+0.7%
5.0K
total stars
#188
sacridini/Awesome-Geospatial

A comprehensive collection of geospatial tools and resources for data analysis, machine learning, and spatial applications.

+36
+0.8%
4.8K
total stars
#189
openmaptiles/openmaptiles

OpenMapTiles is an open-source vector tile schema implementation for creating custom map tiles.

+36
+1.2%
3.0K
total stars
#190
jvns/pandas-cookbook

Pandas Cookbook is a collection of recipes for using Python's powerful data analysis library, Pandas.

+35
+0.5%
7.0K
total stars
#191
skyzh/mini-lsm

A Rust-based implementation of an LSM-Tree storage engine (database) for developers to build and learn from.

+35
+0.9%
3.9K
total stars
#192
documentdb/documentdb

MongoDB-compatible database engine for cloud-native and open-source workloads with scalability and performance.

+35
+1.1%
3.2K
total stars
#193
aimeos/upscheme

A database migration and schema management tool for PHP developers, supporting multiple database engines.

+35
+1.4%
2.6K
total stars
#194
rilldata/rill

Rill is a tool for transforming data sets into powerful dashboards using SQL, enabling BI-as-code.

+35
+1.4%
2.5K
total stars
#195
feldera/feldera

The Feldera Incremental Computation Engine is a Rust-based library for building real-time data pipelines and materialized views.

+35
+2.0%
1.8K
total stars
#196
lakekeeper/lakekeeper

Lakekeeper is an open-source, secure, and fast Apache Iceberg REST Catalog written in Rust for data lakehouse governance.

+35
+3.0%
1.2K
total stars
#197
SheetJS/sheetjs

SheetJS Spreadsheet Data Toolkit for data extraction and spreadsheet generation.

+34
+0.1%
36.2K
total stars
#198
knex/knex

SQL query builder for multiple databases

+34
+0.2%
20.2K
total stars
#199
OpenRefine/OpenRefine

OpenRefine is a powerful data cleaning and transformation tool that helps developers work with messy data.

+34
+0.3%
11.8K
total stars
#200
apache/beam

Apache Beam is a unified programming model for batch and streaming data processing.

+34
+0.4%
8.5K
total stars
1...35...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.