Showing 21-40 of 310 projects
Distributed transaction solution for microservices
Apache Flink is a stream processing framework for real-time and batch data processing.
Build interactive data apps and dashboards with Python, no JavaScript required.
Open-source BI tool for data visualization and analysis
Python library for downloading financial data from Yahoo! Finance
Workflow orchestration for resilient data pipelines in Python
Recommenders is a project for prototyping and operationalizing recommendation systems with Jupyter notebooks and best practices.
High-performance observability data pipeline for logs and metrics
Open-source IoT platform for device management, data collection, and visualization
AI-powered dataset management and preprocessing library for ML projects
Data integration platform for ELT pipelines from APIs, databases & files to databases, warehouses & lakes
Time series forecasting with Prophet for multiple seasonality and growth patterns.
Open-source semantic layer for AI, BI, and embedded analytics
Argo Workflows is a powerful open-source workflow engine for Kubernetes, enabling complex data processing and machine learning pipelines.
dvc is a data versioning and ML experiments tool that helps developers manage and track data and model changes.
An open-source data orchestration platform for developing, running, and observing data pipelines and workflows.
A comprehensive cookbook for data engineers, covering best practices, big data, and data engineering concepts.
An open-source framework for change data capture from various databases using Apache Kafka.
dbt enables data analysts and engineers to transform data using software engineering practices.
A Python library that helps ensure data quality and reliability through data profiling and testing.
Get weekly updates on trending AI coding tools and projects.