Showing 1-20 of 65 projects
A modern, enterprise-ready business intelligence web application for data visualization and exploration.
Learn to build production-grade ML applications with code and best practices
Apache Airflow for workflow orchestration
Free 9-week data engineering course with hands-on modules on pipelines, dbt, Kafka, and Spark
Curated resources for data science and machine learning in production
Workflow orchestration for resilient data pipelines in Python
Data integration platform for ELT pipelines from APIs, databases & files to databases, warehouses & lakes
Taipy is a Python library that helps developers turn data and AI algorithms into production-ready web apps quickly.
Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.
An open-source data orchestration platform for developing, running, and observing data pipelines and workflows.
A comprehensive cookbook for data engineers, covering best practices, big data, and data engineering concepts.
This is a roadmap for becoming a data engineer, not a developer discovery platform for vibe coders.
A Python library that helps ensure data quality and reliability through data profiling and testing.
A powerful, Python-powered shell with cross-platform support and a rich feature set for developers.
An open-source, Rust-based event streaming platform for real-time data processing and analytics.
mage-ai is a Python-based platform for building, running, and managing data pipelines and integrating/transforming data.
A highly configurable, production-ready stream processing platform for building real-time data pipelines.
Open-source feature flagging and A/B testing platform for experimentation, data analysis, and remote config.
An open-source feature store for AI/ML applications
Data pipelines for cloud config and security data, enabling CSPM, FinOps, and vulnerability management solutions.
Get weekly updates on trending AI coding tools and projects.