Category
Showing 151-200 of 897 trending projects
A lightweight, document-oriented database optimized for happiness, used as a Python library or CLI.
An open-access book on scientific visualization using Python and Matplotlib for data-driven developers
Cloud-based database manager UI for querying, managing, and visualizing databases across multiple platforms.
A Python library that provides a set of customizable pipeline processing blocks for data processing tasks.
A Python library with most common stock market technical indicators, making it easy to implement quantitative finance and algorithmic trading.
A Python library for conveniently reading data from the Tongdaxin financial data platform.
Open-source relational database engine powering web apps, APIs, and data-driven backends worldwide.
A Python library for 3D plotting and mesh analysis using the Visualization Toolkit (VTK)
Distributed transactional key-value database, originally created to complement TiDB
An educational relational database management system (RDBMS) implementation in C++.
A high-performance, embeddable key-value storage engine written in Rust for developers building data-intensive applications.
A comprehensive cookbook for data engineers, covering best practices, big data, and data engineering concepts.
A free, open-source SQLite database manager for multiple platforms.
Meltano is a declarative, code-first data integration engine for building and scaling data and ML-powered products.
A Python tool for automatically scraping data on China's statutory holidays from government announcements.
A high-performance, distributed data integration tool for batch, streaming, and CDC use cases.
A SQL database explorer supporting multiple database engines like SQLite, PostgreSQL, and MySQL.
A Python library for fast, customizable, and interactive data profiling and exploratory data analysis.
A free, interactive SQL learning platform with an online SQL editor, real-time query results, and syntax highlighting.
Archive, search, and analyze your entire email/chat history offline with DuckDB-powered analytics and AI queries.
GDAL is an open-source library for working with various geospatial data formats, useful for remote sensing and GIS applications.
Automatically generates beautiful and easy-to-read ER diagrams from your database.
A Python library that provides a simple and unified interface for extracting text from any document format.
OrioleDB is a cloud-native PostgreSQL extension that solves performance and scalability challenges.
Official Rust implementation of the Apache Arrow data format for efficient data processing and storage.
A Python library to access historical market data from the Binance cryptocurrency exchange.
Lightweight and extensible compatibility layer between popular dataframe libraries like Pandas, Dask, and PySpark.
A collection of data analysis and machine learning projects and resources for developers.
An open-source data modeling tool designed for PostgreSQL, allowing developers to generate DDL commands visually.
Apache Parquet Format, a columnar data storage format used in the Apache Hadoop ecosystem.
Apache Fluss is a real-time streaming storage platform built for big data analytics.
MetricFlow allows developers to define, build, and maintain metrics in code for business intelligence and analytics.
A fast, flexible, ocean-flavored fluid dynamics library for climate and ocean modeling on CPUs and GPUs.
Apache Cassandra is a distributed, wide-column store database system designed for high availability, scalability, and performance.
A JavaScript library that allows you to run SQLite on the web, enabling local database functionality for web apps.
Draco is a C++ library for compressing and decompressing 3D geometric meshes and point clouds.
A curated list of awesome database tools and resources to make working with databases easier.
A comprehensive collection of geospatial tools and resources for data analysis, machine learning, and spatial applications.
OpenMapTiles is an open-source vector tile schema implementation for creating custom map tiles.
Pandas Cookbook is a collection of recipes for using Python's powerful data analysis library, Pandas.
A Rust-based implementation of an LSM-Tree storage engine (database) for developers to build and learn from.
MongoDB-compatible database engine for cloud-native and open-source workloads with scalability and performance.
A database migration and schema management tool for PHP developers, supporting multiple database engines.
Rill is a tool for transforming data sets into powerful dashboards using SQL, enabling BI-as-code.
The Feldera Incremental Computation Engine is a Rust-based library for building real-time data pipelines and materialized views.
Lakekeeper is an open-source, secure, and fast Apache Iceberg REST Catalog written in Rust for data lakehouse governance.
SheetJS Spreadsheet Data Toolkit for data extraction and spreadsheet generation.
OpenRefine is a powerful data cleaning and transformation tool that helps developers work with messy data.
Apache Beam is a unified programming model for batch and streaming data processing.
Get weekly updates on trending AI coding tools and projects.