Category
Showing 51-100 of 897 trending projects
Distributed key-value store for critical distributed system data
Portfolio analytics library for quantitative finance, built with Python
Official Git mirror of the SQLite source tree, a popular and widely-used embedded database engine.
Fast, embedded graph database with vector search and full-text search, compatible with Cypher queries.
Open-source relational database management system (RDBMS) for building data-driven applications.
A cross-platform TUI database management tool written in Go for developers working with databases.
dbt enables data analysts and engineers to transform data using software engineering practices.
High-performance distributed graph database for real-time use cases
libSQL is an open-source, open-contribution fork of SQLite, a widely used embedded database.
An open-source data catalog platform for building a high-performance, federated metadata lake.
A Python library for crawling historical data of China stocks.
A Python script to fetch Garmin health data and populate it in an InfluxDB database for visualization in Grafana.
A free, open-source Python library for fetching real-time stock data from Chinese stock exchanges.
JuiceFS is a distributed POSIX file system built on top of Redis and S3 for big data and cloud-native applications.
An open-source Python library that simplifies the process of loading data into data lakes and warehouses.
Apache DataFusion is a powerful SQL query engine written in Rust, designed for big data processing and analysis.
A quantitative research and stock analysis platform for finance professionals.
Redis desktop manager with GUI for managing Redis databases on Linux, Windows, Mac
An open-source framework for change data capture from various databases using Apache Kafka.
A Python library for scraping soccer data from various sources for sports analytics and data science.
Google's Operations Research tools for combinatorial optimization, linear programming, and operations research.
A Python library that helps ensure data quality and reliability through data profiling and testing.
A Python library to programmatically access and manage photos and metadata in the Apple Photos library on macOS.
networkx is a Python library for creating, manipulating, and studying the structure and dynamics of complex networks.
Trino is a distributed SQL query engine for big data, allowing fast, scalable, and cost-effective analytics.
Embeddable, persistent key-value store for fast storage with LSM design
A curated list of data engineering tools for software developers, not focused on AI coding tools.
Redis 6.0.20 through 8.0.0 for Windows, a popular open-source in-memory data structure store.
QuestDB is a high-performance, open-source, time-series database for real-time analytics and financial applications.
A high-performance open source query engine for sub-second analytics on data lakehouse.
Comprehensive Chinese poetry database with JSON-formatted data for developers
A Redis-compatible database implemented in Go, supporting SQL and multiple backends like PostgreSQL and SQLite.
A comprehensive collection of data science cheatsheets for developers and data scientists.
Open-source graph database optimized for dynamic analytics and streaming data environments.
Apache Doris is a high-performance, unified analytics database for real-time data processing.
A toolkit for SQLite databases, focused on application development with a Swift-based API.
Apache Iceberg is an open-source table format for large analytic datasets, providing a versioned and scalable data lake architecture.
SciPy is a Python library for scientific and technical computing, providing a wide range of algorithms and tools.
An open-source metadata platform for managing your data and AI stack across the enterprise.
A Postgres extension for high-performance vector search, complementing pgvector for scale.
A Python library for financial analysis and data scraping from the Finviz platform.
A curated list of awesome PostgreSQL software, libraries, tools and resources.
Synthea is an open-source synthetic patient population simulator for generating realistic healthcare data.
An extensible, high-performance columnar file format for data storage and processing.
An open-source, self-hosted database management tool with a spreadsheet-like interface for Postgres
A data platform that enables building data pipelines with SQL, Python, and ingesting from various sources.
Apache Arrow is a fast columnar data format and toolset for in-memory analytics and data interchange.
Citus is a distributed PostgreSQL database that enables scaling out your Postgres database across multiple nodes.
Python wrapper for the TA-Lib technical analysis library, useful for financial pattern recognition.
Get weekly updates on trending AI coding tools and projects.