Showing 41-60 of 85 projects
An open-source BI reporting and dashboard platform with data visualization and business intelligence features.
A highly scalable, high-performance graph database that supports over 100 billion data points.
FeatureBase is a fast analytical database built on bitmaps, perfect for ML and data-intensive applications.
Bare-bones examples of machine learning in TensorFlow for developers working with AI tools.
Open-source BI platform for engineers to explore and model large-scale data pipelines.
ArcticDB is a high-performance, serverless DataFrame database for the Python data science ecosystem.
Open-source web UI for managing Apache Kafka clusters, a popular distributed streaming platform.
A high-performance search engine capable of handling 100 trillion lines of log data using Go.
Apache BookKeeper is a scalable, fault tolerant and low latency storage service optimized for append-only workloads.
Apache DataFusion Ballista is a distributed query engine for big data analysis, built with Rust and Arrow.
Apache Kudu is a high-performance, open-source columnar storage engine for large datasets in the Apache Hadoop ecosystem.
Fluid is a distributed data abstraction and acceleration framework for Big Data and AI applications on the cloud.
Apache Fluss is a real-time streaming storage platform built for big data analytics.
A large-scale entity and relation database supporting aggregation of properties for big data applications.
Genie is a distributed big data orchestration service that helps manage and execute complex data pipelines.
The Auron accelerator framework leverages vectorized execution to speed up distributed computing on big data platforms like Spark.
Distributed high-performance data integration engine for batch, streaming, and incremental scenarios.
Apache Spark and Python tutorials for big data analysis and machine learning as Jupyter notebooks.
A framework-agnostic dashboard library that allows creating dashboards using YAML or JSON files.
Tonbo is an embedded database for serverless and edge runtimes, optimized for offline-first and big data use cases.
Get weekly updates on trending AI coding tools and projects.