Explore Projects

Discover 17 open source projects

Active filters (1):
Search: airflowร—
Clear all

Showing 1-17 of 17 projects

apache/airflow

Apache Airflow for workflow orchestration

44.5K
Active
Python
ETL & Pipelines
Background Jobs
Python
#airflow#data-pipelines#workflow-orchestration

argoproj/argo-workflows

Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.

16.5K
Active
Go
ETL & Pipelines
Kubernetes
#kubernetes#pipelines#workflow

windmill-labs/windmill

An open-source developer platform to power entire infra and turn scripts into webhooks, workflows, and UIs.

15.9K
Active
HTML
Next.js
#authentication#streaming#real-time

apache/dolphinscheduler

Apache DolphinScheduler is a modern data orchestration platform for creating high-performance workflows with low-code.

14.2K
Active
Java
Realtime
#workflow-orchestration#job-scheduler#data-pipelines

jghoman/awesome-apache-airflow

Curated list of resources about Apache Airflow, a popular workflow management platform.

3.9K
Active
Shell
CLI Tools
Background Jobs
#airflow#workflow-management#etl

puckel/docker-airflow

A Docker-based Apache Airflow platform for building and managing data pipelines and workflows.

3.8K
Archived
Shell
Background Jobs
ETL & Pipelines
Docker
#airflow#workflow#scheduler

WeBankFinTech/DataSphereStudio

DataSphereStudio is a one-stop data application development and management portal covering data exchange, analysis, and visualization.

3.3K
Stable
Java
ETL & Pipelines
API Frameworks
Spark
#data-management#data-analysis#data-visualization

elyra-ai/elyra

Elyra extends JupyterLab with an AI-centric approach for developing and deploying ML/AI pipelines.

2.0K
Active
Python
ML Ops
MCP Frameworks
JupyterLab
#ai#machine-learning#jupyterlab

san089/Udacity-Data-Engineering-Projects

A collection of Udacity data engineering projects showcasing various tools and technologies.

1.8K
Archived
Python
Airflow
#data-engineering#cloud#infrastructure

teamclairvoyant/airflow-maintenance-dags

A set of Airflow DAGs to help maintain and manage the operation of an Airflow deployment.

1.8K
Archived
Python
API Frameworks
ETL & Pipelines
Apache Airflow
#airflow#maintenance#cleanup

OBenner/data-engineering-interview-questions

This GitHub repository contains over 2,000 data engineering interview questions to help developers prepare.

1.5K
Active
Python
Interview Prep
ETL & Pipelines
#data-engineering#interview-questions#interview-prep

san089/goodreads_etl_pipeline

An end-to-end data pipeline for building a data lake, data warehouse, and analytics platform from GoodReads data.

1.5K
Archived
Python
ETL & Pipelines
Background Jobs
Apache Airflow
#data-engineering#etl-pipeline#data-lake

astronomer/dag-factory

Declaratively construct Apache Airflow DAGs with YAML configuration files, simplifying complex data pipeline management.

1.4K
Active
Python
API Frameworks
ETL & Pipelines
Python
#airflow#data-pipelines#etl

damklis/DataEngineeringProject

An end-to-end data engineering project example showcasing tools and technologies for building data pipelines.

1.4K
Archived
Python
ETL & Pipelines
API Frameworks
Django
#data-engineering#data-pipeline#etl

gtoonstra/etl-with-airflow

This repository provides best practices and examples for building ETL (Extract, Transform, Load) pipelines using Apache Airflow.

1.4K
Archived
Shell
ETL & Pipelines
#etl#airflow#data-pipelines

astronomer/astronomer-cosmos

Run your dbt Core or dbt Fusion projects as Apache Airflow DAGs and Task Groups with a few lines of code.

1.2K
Active
Python
API Frameworks
ETL & Pipelines
Python
#airflow#dbt#workflow

abhishek-ch/around-dataengineering

A comprehensive knowledge hub for data engineering, machine learning, and MLOps tools and practices.

1.1K
Archived
Python
ETL & Pipelines
ML Ops
Python
#data-engineering#machine-learning#mlops

Stay in the loop

Get weekly updates on trending AI coding tools and projects.