Showing 41-60 of 299 projects
Sarama is a Go library for Apache Kafka, a distributed streaming platform for building real-time data pipelines.
A simple, fast, and repeatable build framework for creating Docker images and CI/CD pipelines.
Kubescape is an open-source Kubernetes security platform that provides risk analysis, security, compliance, and misconfiguration scanning.
A Python library that helps ensure data quality and reliability through data profiling and testing.
Kedro is a Python toolkit for building production-ready data science and machine learning pipelines.
PRQL is a modern, powerful, and pipelined SQL replacement for transforming data.
StreamDiffusion is a Python library that provides a pipeline-level solution for real-time interactive generation.
A full-text and vector search engine for developers, with support for typo-tolerance and hybrid search in under 2kb.
A Python Automated Machine Learning tool that optimizes ML pipelines using genetic programming.
A distributed system for running large language models (LLMs) on personal devices, enabling faster fine-tuning and inference.
Fast and multi-purpose HTTP toolkit for running multiple probes using the retryablehttp library.
This repository helps developers set up a full CI/CD pipeline with Jenkins, Docker, Kubernetes, and Argo CD for deployment.
A cloud-native Pipeline resource for building and deploying applications on Kubernetes.
An open-source, Rust-based event streaming platform for real-time data processing and analytics.
mage-ai is a Python-based platform for building, running, and managing data pipelines and integrating/transforming data.
An open-source data lakehouse framework that enables building data pipelines with leading big data compute engines.
A highly configurable, production-ready stream processing platform for building real-time data pipelines.
Screaming-fast Python HTTP toolkit with pipelining HTTP server based on uvloop and picohttpparser.
BentoML is an easy-to-use framework for building and deploying production-ready machine learning models as APIs.
Pentaho Data Integration (ETL) is a Java-based tool for building data integration and ETL pipelines.
Get weekly updates on trending AI coding tools and projects.