Showing 161-180 of 299 projects
Fast, single-binary C++ SQL ETL pipeline for stream processing, observability, analytics, and AI/ML.
A spreadsheet-based pipeline for running a nanoGPT model, aimed at developers working with AI tools.
Deep learning library for Apache Spark that provides high-level APIs and models for building machine learning pipelines.
Elyra extends JupyterLab with an AI-centric approach for developing and deploying ML/AI pipelines.
Bytewax is a Python library for building scalable, fault-tolerant, and low-latency data processing pipelines.
A spaCy pipeline and models for processing scientific/biomedical documents.
A community-driven wiki for learning data engineering, covering topics like data modeling, pipelines, and databases.
A flexible machine learning framework for the Julia programming language, used for classification, clustering, and more.
The Azure Pipelines Agent is a tool for running build and deployment tasks in a CI/CD pipeline.
MongoDB data stream pipeline tools for managing real-time data synchronization and replication.
A real-time streaming platform built on Apache Flink for building scalable and reliable data pipelines.
A pipeline parallel training script for diffusion models, useful for AI and machine learning researchers.
The font-awesome font bundled as an asset for the Rails asset pipeline.
Robust and performant image loading and caching framework for iOS clients
Voluptuous is a Python data validation library for building flexible data validation pipelines.
Byzer is a low-code open-source programming language for data pipeline, analytics and AI.
Fast local PDF-to-Markdown/JSON converter for RAG pipelines. No GPU needed.
A collection of Udacity data engineering projects showcasing various tools and technologies.
A flexible and modular Go-based web crawler framework with a concurrent architecture.
This repository provides comprehensive tutorials and resources for learning data science and machine learning using Python.
Get weekly updates on trending AI coding tools and projects.