Showing 1-20 of 20 projects
Apache Doris is a high-performance, unified analytics database for real-time data processing.
Trino is a distributed SQL query engine for big data, allowing fast, scalable, and cost-effective analytics.
A high-performance open source query engine for sub-second analytics on data lakehouse.
An open-source, Rust-based event streaming platform for real-time data processing and analytics.
Apache Iceberg is an open-source table format for large analytic datasets, providing a versioned and scalable data lake architecture.
High-performance data engine for AI and multimodal workloads, processing images, audio, video, and structured data at scale.
A dark, bluish color scheme for Vim and Neovim, popular among developers and suitable for 'vibe coders'.
Fast, single-binary C++ SQL ETL pipeline for stream processing, observability, analytics, and AI/ML.
A Rust-based library that provides real-time analytics on Postgres tables, supporting features like columnstore, delta-lake, and Iceberg.
Apache Polaris is an open-source catalog for Apache Iceberg, a high-performance table format for data lakes.
Apache Kafka-compatible broker with support for S3, PostgreSQL, SQLite, Apache Iceberg, and Delta Lake.
Provides a JSON template to customize AWS Well-Architected reviews using Custom Lenses.
Postgres with Iceberg and data lake access for developers
Nessie is a transactional data catalog for data lakes that provides Git-like semantics and functionality.
Fastest open-source data pipeline tool for replicating databases to data lakes in Apache Iceberg format.
A Rust implementation of the Apache Iceberg data lake table format.
Lakekeeper is an open-source, secure, and fast Apache Iceberg REST Catalog written in Rust for data lakehouse governance.
Apache XTable is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.
Apache Amoro is an open-source Lakehouse management system built on big data formats like Flink, Hudi, and Iceberg.
Open-source data warehouse learning project with examples and code for building real-time and offline data pipelines.
Get weekly updates on trending AI coding tools and projects.