Showing 21-40 of 63 projects
A system for agentic LLM-powered data processing and ETL workflows for unstructured data analysis.
Flow-based programming framework for building complex JavaScript applications and services.
A curated list of resources for creating node-based UI editors and visual programming tools.
Python scripts for extracting, transforming and loading Ethereum blockchain data into Google BigQuery.
Fast, cost-effective data replication tool from Postgres to data warehouses, queues, and storage
An open-source dev data platform to ingest, analyze, and visualize data from DevOps tools for engineering insights.
Scalable and efficient data transformation framework with backwards compatibility for dbt.
Header-only C++ library providing STL-like containers & algorithms for embedded systems without dynamic memory.
Comprehensive analytics, versioning, and ETL toolkit for multimodal data (video, audio, PDFs, images)
Hamilton is an open-source ETL framework that helps data scientists and engineers build modular, testable dataflows with lineage and metadata.
Instill Core is an open-source AI infrastructure tool for orchestrating data, models, and pipelines to build AI-powered applications.
A real-time Postgres data replication and streaming library built in Rust for building CDC pipelines.
A lightweight stream processing library for Go developers that supports various streaming platforms.
Fast, single-binary C++ SQL ETL pipeline for stream processing, observability, analytics, and AI/ML.
superglue builds integrations and tools from natural language for long-tail and enterprise systems.
A community-driven wiki for learning data engineering, covering topics like data modeling, pipelines, and databases.
A collection of Udacity data engineering projects showcasing various tools and technologies.
A data processing and ETL (Extract, Transform, Load) framework for Ruby developers.
AIStore: A scalable, high-performance, and high-availability storage solution for AI applications and workloads.
Powerful, fast, and efficient unstructured data extraction library written in Rust with language bindings.
Get weekly updates on trending AI coding tools and projects.