Showing 1-7 of 7 projects
An open-source, Rust-based event streaming platform for real-time data processing and analytics.
Easy-to-use streaming application development framework and operation platform for building ETL pipelines.
Hamilton is an open-source ETL framework that helps data scientists and engineers build modular, testable dataflows with lineage and metadata.
A collection of Udacity data engineering projects showcasing various tools and technologies.
Powerful, fast, and efficient unstructured data extraction library written in Rust with language bindings.
An end-to-end data pipeline for building a data lake, data warehouse, and analytics platform from GoodReads data.
An enterprise-grade, API-first LLM workspace for unstructured document processing, with features like data extraction, redaction, and prompt engineering.
Get weekly updates on trending AI coding tools and projects.