Explore Projects

Discover 34 open source projects

Active filters (1):
Search: bigdata×
Clear all

Showing 1-20 of 34 projects

DataExpert-io/data-engineer-handbook

Comprehensive data engineering resource hub with learning paths, books, communities, and tools

40.4K
Stable
Jupyter Notebook
Tutorials & Courses
Awesome Lists
Apache Airflow
#dataengineering#bigdata#apachespark

taosdata/TDengine

High-performance time-series database for IoT and IIoT

24.8K
Active
C
Databases
#time-series#iot#industrial-iot

rustfs/rustfs

High-performance, distributed object storage system compatible with S3, built in Rust for speed and safety

22.8K
Active
Rust
Containerization
Search
#object-storage#s3-compatible#rust

apache/shardingsphere

Distributed SQL database middleware for sharding, scalability, and security

20.7K
Active
Java
Databases
Java
#distributed-sql#database-sharding#data-encryption

heibaiying/BigData-Notes

A comprehensive guide to big data technologies like Hadoop, Spark, Kafka, and more for developers.

16.9K
Archived
Java
Databases
#big-data#hadoop#spark

oxnr/awesome-bigdata

A curated list of awesome big data frameworks, resources and other awesomeness.

14.3K
Stable
Databases
#big-data#data-analytics#data-science

juicedata/juicefs

JuiceFS is a distributed POSIX file system built on top of Redis and S3 for big data and cloud-native applications.

13.3K
Active
Go
Databases
Go
#object-storage#s3#redis

wangzhiwubigdata/God-Of-BigData

A comprehensive collection of resources and learning materials for big data technologies like Flink, Spark, Hadoop, and Hive.

10.4K
Archived
Databases
#big-data#hadoop#spark

databendlabs/databend

Unified cloud-native data warehouse platform for analytics, search and AI, built on top of S3 storage.

9.2K
Active
Rust
Databases
Search
Rust
#cloud-native#data-warehouse#analytics

vaexio/vaex

A high-performance Python library for working with large tabular datasets, offering efficient data manipulation and visualization.

8.5K
Stable
Python
Databases
Caching
Python
#bigdata#data-science#dataframe

volcano-sh/volcano

A Cloud Native Batch System for running AI/ML workloads on Kubernetes at scale.

5.4K
Active
Go
ML Ops
API Frameworks
Kubernetes
#ai#batch-processing#kubernetes

TurboWay/bigdata_analyse

This is a Python project for big data analysis, focusing on HQL, SQL, and data processing.

5.0K
Archived
Python
Databases
ETL & Pipelines
#big-data#data-processing#data-analysis

iGaoWei/BigDataView

BigDataView provides a collection of HTML5 big data visualization templates for various industries.

5.0K
Stable
JavaScript
Charts & Visualization
React
#bigdata#echarts#html-template

liyupi/sql-generator

A tool to generate structured SQL statements using JSON, built with Vue3, TypeScript, and Ant Design.

3.5K
Archived
Vue
Component Libraries (Vue/Svelte)
ORMs & Query Builders
Vue
#sql#json#vue3

apache/avro

Apache Avro is a data serialization system for efficient storage and transmission of structured data.

3.2K
Active
Java
Databases
API Clients & Testing
#data-serialization#serialization-framework#big-data

MoRan1607/BigDataGuide

A comprehensive guide to big data, covering various tools and technologies for learning and development.

3.1K
Active
React
#bigdata#machine learning#development

griddb/griddb

GridDB is a fast and scalable open-source database for time-series IoT and big data applications.

2.5K
Stable
C++
Databases
API Frameworks
#bigdata#iot#timeseries

geekyouth/SZT-bigdata

This is a big data analysis system for the Shenzhen metro with support for various data processing tools.

2.4K
Archived
Scala
Databases
API Frameworks
Scala
#big-data#data-analysis#metro

apconw/Aix-DB

A LangChain-based framework for end-to-end natural language to data insight conversion, with MCP Skills multi-agent architecture.

2.0K
Active
JavaScript
LLM Frameworks
MCP Frameworks
LangChain
#llm#langchain#mcp

shzlw/poli

An open-source BI server focused on SQL-based data analysis and business intelligence reporting.

2.0K
Archived
Java
Charts & Visualization
API Frameworks
React
#business-intelligence#data-visualization#sql-editor
2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.