Showing 1-10 of 10 projects
Apache Doris is a high-performance, unified analytics database for real-time data processing.
Trino is a distributed SQL query engine for big data, allowing fast, scalable, and cost-effective analytics.
A high-performance open source query engine for sub-second analytics on data lakehouse.
An open-source data lakehouse framework that enables building data pipelines with leading big data compute engines.
A Rust-based library to create full-fledged APIs for slowly moving datasets without writing code.
A Rust library for interacting with Delta Lake, a data lake storage format, with Python bindings.
A Rust-based library that provides real-time analytics on Postgres tables, supporting features like columnstore, delta-lake, and Iceberg.
Apache Kafka-compatible broker with support for S3, PostgreSQL, SQLite, Apache Iceberg, and Delta Lake.
This is a book that teaches how to use Apache Spark for lightning-fast data analytics.
Apache XTable is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Get weekly updates on trending AI coding tools and projects.