Showing 1-20 of 24 projects
Logstash is a powerful open-source data processing pipeline that can ingest, transform, and output data from a variety of sources.
A Python tool that generates a prompt-friendly extract of a GitHub codebase by replacing 'hub' with 'ingest' in any GitHub URL.
Open-source metering and usage-based billing API for consumption tracking, subscription management, pricing, and revenue analytics.
A high-performance, distributed data integration tool for batch, streaming, and CDC use cases.
An open-source, Rust-based event streaming platform for real-time data processing and analytics.
Optimized file parser for LLM ingestion with no loss, supporting PDFs, Docx, and PPTx.
A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.
Open-source data pipeline engine for real-time ETL, connecting data sources to warehouses like BigQuery, Snowflake, Redshift.
A lightweight ingestion library for fast, efficient and robust RAG pipelines
A comprehensive document search and storage platform for building AI applications using Python.
ingestr is a CLI tool that seamlessly copies data between any databases with a single command.
LakeSoul is a cloud-native, real-time Lakehouse framework for fast data ingestion and analytics on cloud storage.
Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark.
An open-source dev data platform to ingest, analyze, and visualize data from DevOps tools for engineering insights.
A serverless, real-time data analysis framework for ingesting, analyzing, and alerting on data from any environment.
A tool that makes it easy to scrape and ingest content from various sources like GitHub, arXiv, and YouTube for use with large language models.
A PowerShell script and WPF UI tool to manage Intune and Azure policies and profiles.
A data platform that enables building data pipelines with SQL, Python, and ingesting from various sources.
An open-source, scalable online streaming toolkit for developers building video apps and services.
Fastest open-source data pipeline tool for replicating databases to data lakes in Apache Iceberg format.
Get weekly updates on trending AI coding tools and projects.