Explore Projects

Discover 7 open source projects

Active filters (1):
Search: trinoร—
Clear all

Showing 1-7 of 7 projects

trinodb/trino

Trino is a distributed SQL query engine for big data, allowing fast, scalable, and cost-effective analytics.

12.6K
Active
Java
Databases
#big-data#analytics#data-science

tobymao/sqlglot

A Python library for parsing and transpiling SQL queries across various databases and engines.

9.0K
Active
Python
API Frameworks
ORMs & Query Builders
Python
#sql-parser#sql-transpiler#database-abstraction

delta-io/delta

An open-source data lakehouse framework that enables building data pipelines with leading big data compute engines.

8.6K
Active
Scala
ETL & Pipelines
API Frameworks
Spark
#big-data#data-engineering#data-lakehouse

ibis-project/ibis

Portable Python dataframe library for data analysis and manipulation

6.4K
Active
Python
React
#dataframe#python#analysis

datafold/data-diff

A Python library for comparing data across databases, supporting various database engines.

3.0K
Archived
Python
Databases
ETL & Pipelines
#data-diffing#data-quality#data-engineering

wgzhao/Addax

A fast and versatile ETL tool that can transfer data between RDBMS and NoSQL databases seamlessly

1.4K
Active
Java
ETL & Pipelines
API Frameworks
#etl#database#rdbms

apache/amoro

Apache Amoro is an open-source Lakehouse management system built on big data formats like Flink, Hudi, and Iceberg.

1.1K
Active
Java
Databases
ETL & Pipelines
Flink
#big-data#data-lake#lakehouse

Stay in the loop

Get weekly updates on trending AI coding tools and projects.