Showing 1-16 of 16 projects
Apache Airflow for workflow orchestration
Data integration platform for ELT pipelines from APIs, databases & files to databases, warehouses & lakes
Taipy is a Python library that helps developers turn data and AI algorithms into production-ready web apps quickly.
An open-source data orchestration platform for developing, running, and observing data pipelines and workflows.
A high-performance, distributed data integration tool for batch, streaming, and CDC use cases.
mage-ai is a Python-based platform for building, running, and managing data pipelines and integrating/transforming data.
Flink CDC is a streaming data integration tool that enables real-time data pipelines and change data capture.
Data pipelines for cloud config and security data, enabling CSPM, FinOps, and vulnerability management solutions.
Fluvio is an event stream processing engine for developers to build responsive data-intensive apps.
Open-source data pipeline engine for real-time ETL, connecting data sources to warehouses like BigQuery, Snowflake, Redshift.
Rudder Server is a privacy-focused, Segment-alternative customer data platform written in Go and React.
A curated list of software packages and data resources for single-cell analysis, including RNA-seq and ATAC-seq.
ingestr is a CLI tool that seamlessly copies data between any databases with a single command.
An open-source dev data platform to ingest, analyze, and visualize data from DevOps tools for engineering insights.
Distributed high-performance data integration engine for batch, streaming, and incremental scenarios.
Hop is a flexible and extensible open-source data integration platform for building and orchestrating ETL and streaming pipelines.
Get weekly updates on trending AI coding tools and projects.