Showing 21-40 of 299 projects
Workflow orchestration for resilient data pipelines in Python
High-performance observability data pipeline for logs and metrics
Data integration platform for ELT pipelines from APIs, databases & files to databases, warehouses & lakes
Distributed SQL database middleware for sharding, scalability, and security
Reactive Python notebook for data science and AI with git-friendly, deployable, and AI-native features
Taipy is a Python library that helps developers turn data and AI algorithms into production-ready web apps quickly.
Luigi is a Python module that helps developers build complex batch job pipelines with dependency management and workflow orchestration.
Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.
Dagger is an automation engine that helps developers build, test, and ship any codebase across CI/CD pipelines.
Kubeflow is a machine learning toolkit for building and deploying scalable ML pipelines on Kubernetes.
Turn websites into clean data pipelines & structured APIs in minutes with a low-code web scraping tool.
An open-source data orchestration platform for developing, running, and observing data pipelines and workflows.
Unified framework for building enterprise RAG pipelines with small, specialized models
Logstash is a powerful open-source data processing pipeline that can ingest, transform, and output data from a variety of sources.
Apache DolphinScheduler is a modern data orchestration platform for creating high-performance workflows with low-code.
Unstructured is an open-source ETL solution for transforming complex documents into structured data for language models.
A Rust-based memory layer for AI agents, enabling serverless, single-file memory with instant retrieval and long-term storage.
Codis is a proxy-based Redis cluster solution that supports pipelining and dynamic scaling.
An open-source library that simplifies the process of loading 3D file formats into a unified data structure for game development and asset pipelines.
An open-source framework for change data capture from various databases using Apache Kafka.
Get weekly updates on trending AI coding tools and projects.