Explore Projects

Discover 16 open source projects

Active filters (1):
Search: streaming-dataร—
Clear all

Showing 1-16 of 16 projects

oxnr/awesome-bigdata

A curated list of awesome big data frameworks, resources and other awesomeness.

14.3K
Stable
Databases
#big-data#data-analytics#data-science

provectus/kafka-ui

An open-source web UI for managing Apache Kafka clusters, supporting developers working with event streaming.

11.9K
Archived
Java
API Frameworks
#apache-kafka#event-streaming#cluster-management

johnkerl/miller

Miller is a powerful CLI tool for processing tabular data like CSV, TSV, and JSON, similar to awk, sed, and other Unix utilities.

9.8K
Active
Go
CLI Tools
#csv#json#data-processing

redpanda-data/connect

A highly configurable, production-ready stream processing platform for building real-time data pipelines.

8.6K
Active
Go
Realtime
ETL & Pipelines
Go
#stream-processing#message-queue#data-engineering

online-ml/river

A Python library for online machine learning, enabling incremental and real-time learning on data streams.

5.7K
Active
Python
ML Ops
Streaming
Python
#concept-drift#incremental-learning#online-learning

readysettech/readyset

A high-performance caching layer that speeds up queries and scales read throughput for MySQL and Postgres databases.

5.2K
Active
Rust
Caching
Caching
#caching#databases#mysql

fluvio-community/fluvio

Fluvio is an event stream processing engine for developers to build responsive data-intensive apps.

5.2K
Active
Rust
Data Pipelines
Realtime
Rust
#streaming#real-time#data-processing

memgraph/memgraph

Open-source graph database optimized for dynamic analytics and streaming data environments.

3.8K
Active
C++
Databases
Realtime
#graph-database#streaming#realtime

piskvorky/smart_open

Utilities for streaming large files (S3, HDFS, gzip, bz2) in Python.

3.4K
Active
Python
API Frameworks
Caching
Python
#streaming#file-handling#s3

reugn/go-streams

A lightweight stream processing library for Go developers that supports various streaming platforms.

2.2K
Active
Go
API Frameworks
ETL & Pipelines
#stream-processing#data-pipeline#kafka

kafbat/kafka-ui

Open-source web UI for managing Apache Kafka clusters, a popular distributed streaming platform.

2.1K
Active
Java
API Frameworks
Databases
Java
#apache-kafka#big-data#cluster-management

bytewax/bytewax

Bytewax is a Python library for building scalable, fault-tolerant, and low-latency data processing pipelines.

2.0K
Experimental
Python
ETL & Pipelines
API Frameworks
Python
#streaming#data-engineering#data-processing

python-streamz/streamz

A real-time stream processing library for Python that enables efficient handling of streaming data.

1.3K
Active
Python
API Frameworks
Realtime
Python
#async#real-time#streaming-data

microsoft/Trill

Trill is a single-node query processor for temporal or streaming data.

1.3K
Archived
C#
Databases
Caching
#streaming-data#temporal-data#database

DoneDeal0/superdiff

Superdiff is a high-performance, zero-dependency library for efficiently comparing and diffing arrays and objects.

1.1K
Active
TypeScript
API Frameworks
Validation
React
#array-comparison#object-comparison#object-diff

zpl-c/zpl

A powerful, cross-platform C library that provides a wide range of utilities for developers.

1.1K
Experimental
C
CLI Tools
#c#cross-platform#threading

Stay in the loop

Get weekly updates on trending AI coding tools and projects.