Category
Showing 51-100 of 897 trending projects
Google's Operations Research tools for combinatorial optimization, linear programming, and operations research.
A Rust-based, Elasticsearch-quality search engine for PostgreSQL, enabling fast, real-time analytics and HTAP use cases.
OctoSQL is a powerful SQL query tool that allows you to join, analyze, and transform data from multiple databases and file formats.
Modern in-memory key-value store for caching and data management
Unified cloud-native data warehouse platform for analytics, search and AI, built on top of S3 storage.
Open-source relational database management system (RDBMS) for building data-driven applications.
Portfolio analytics library for quantitative finance, built with Python
efinance is a Python library for quickly accessing financial data (funds, stocks, bonds, futures) and backtesting/quantitative trading.
Comprehensive Chinese poetry database with JSON-formatted data for developers
Embeddable, persistent key-value store for fast storage with LSM design
Official Git mirror of the SQLite source tree, a popular and widely-used embedded database engine.
Open-source graph database optimized for dynamic analytics and streaming data environments.
A Python library for crawling historical data of China stocks.
An open-source framework for change data capture from various databases using Apache Kafka.
dbt enables data analysts and engineers to transform data using software engineering practices.
A curated list of awesome PostgreSQL software, libraries, tools and resources.
A high-performance open source query engine for sub-second analytics on data lakehouse.
An open-source multi-tool for exploring and publishing data, focused on simplifying data analysis and sharing.
A Chinese translation of a popular book on using Python for data analysis with libraries like pandas and numpy.
Apache Iceberg is an open-source table format for large analytic datasets, providing a versioned and scalable data lake architecture.
A high-performance Python library for working with large tabular datasets, offering efficient data manipulation and visualization.
A tutorial and implementation of a disease-centered medical knowledge graph and QA system.
A Python library for quantitative trading and stock analysis.
Redisson is a Java client for Redis and Valkey with distributed objects and services
This is a Python project for big data analysis, focusing on HQL, SQL, and data processing.
Cloud-based database manager UI for querying, managing, and visualizing databases across multiple platforms.
A free, interactive SQL learning platform with an online SQL editor, real-time query results, and syntax highlighting.
A Python library that provides a set of customizable pipeline processing blocks for data processing tasks.
An extensible, high-performance columnar file format for data storage and processing.
AI-native database unifying vector, text, and structured data for hybrid search and in-database AI workflows.
A Python library for conveniently reading data from the Tongdaxin financial data platform.
A data repository for the data journalism site FiveThirtyEight, containing data and code behind their articles and graphics.
Distributed transactional key-value database, originally created to complement TiDB
libSQL is an open-source, open-contribution fork of SQLite, a widely used embedded database.
A comprehensive cookbook for data engineers, covering best practices, big data, and data engineering concepts.
A curated list of awesome big data frameworks, resources and other awesomeness.
Python wrapper for the TA-Lib technical analysis library, useful for financial pattern recognition.
A tutorial for writing a SQLite clone from scratch in C, a useful resource for developers building database-backed applications.
A comprehensive database of countries, states, and cities with data in multiple formats
Apache DataFusion is a powerful SQL query engine written in Rust, designed for big data processing and analysis.
A toolkit for SQLite databases, focused on application development with a Swift-based API.
Fast, correct Python JSON library supporting dataclasses, datetimes, and numpy
Draco is a C++ library for compressing and decompressing 3D geometric meshes and point clouds.
A curated list of software packages and data resources for single-cell analysis, including RNA-seq and ATAC-seq.
Get weekly updates on trending AI coding tools and projects.