Explore Projects

Discover 10 open source projects

Active filters (1):
Search: delta-lakeร—
Clear all

Showing 1-10 of 10 projects

apache/doris

Apache Doris is a high-performance, unified analytics database for real-time data processing.

15.1K
Active
Java
Databases
Spark
#database#olap#real-time

trinodb/trino

Trino is a distributed SQL query engine for big data, allowing fast, scalable, and cost-effective analytics.

12.6K
Active
Java
Databases
#big-data#analytics#data-science

StarRocks/starrocks

A high-performance open source query engine for sub-second analytics on data lakehouse.

11.4K
Active
Java
Databases
#analytics#big-data#database

delta-io/delta

An open-source data lakehouse framework that enables building data pipelines with leading big data compute engines.

8.6K
Active
Scala
ETL & Pipelines
API Frameworks
Spark
#big-data#data-engineering#data-lakehouse

roapi/roapi

A Rust-based library to create full-fledged APIs for slowly moving datasets without writing code.

3.4K
Stable
Rust
API Frameworks
Databases
#analytics#column-store#data-lake

delta-io/delta-rs

A Rust library for interacting with Delta Lake, a data lake storage format, with Python bindings.

3.2K
Active
Rust
ETL & Pipelines
API Frameworks
#delta-lake#etl#data-engineering

Mooncake-Labs/pg_mooncake

A Rust-based library that provides real-time analytics on Postgres tables, supporting features like columnstore, delta-lake, and Iceberg.

1.9K
Stable
Rust
API Frameworks
Databases
#analytics#columnstore#delta-lake

tansu-io/tansu

Apache Kafka-compatible broker with support for S3, PostgreSQL, SQLite, Apache Iceberg, and Delta Lake.

1.6K
Active
Rust
API Frameworks
Databases
#apache-kafka#s3#postgresql

databricks/LearningSparkV2

This is a book that teaches how to use Apache Spark for lightning-fast data analytics.

1.4K
Archived
Scala
ETL & Pipelines
Databases
Spark
#apache-spark#delta-lake#mlflow

apache/incubator-xtable

Apache XTable is a cross-table converter for lakehouse table formats that facilitates interoperability across data processing systems and query engines.

1.2K
Active
Java
ETL & Pipelines
#interoperability#lakehouse#data-processing

Stay in the loop

Get weekly updates on trending AI coding tools and projects.