Showing 21-31 of 31 projects
A real-time streaming platform built on Apache Flink for building scalable and reliable data pipelines.
A Vue-based chat application with real-time functionality
A comprehensive repository covering big data knowledge, including data warehouse modeling, real-time computing, Hadoop, Spark, and more.
Distributed high-performance data integration engine for batch, streaming, and incremental scenarios.
This GitHub repository contains over 2,000 data engineering interview questions to help developers prepare.
A Java-based framework for building agile DataOps pipelines using tools like Flink, DataX, and Chunjun with a web UI.
A big data development platform for submission, scheduling, operation and maintenance, and indicator information display.
A scalable, SQL-based streaming analytics platform from Uber, built on top of Apache Flink.
Apache Amoro is an open-source Lakehouse management system built on big data formats like Flink, Hudi, and Iceberg.
Open-source data warehouse learning project with examples and code for building real-time and offline data pipelines.
This repository contains training exercises for Apache Flink, a popular distributed stream processing framework.
Get weekly updates on trending AI coding tools and projects.