Explore Projects

Discover 7 open source projects

Active filters (1):
Search: mapreduceร—
Clear all

Showing 1-7 of 7 projects

donnemartin/data-science-ipython-notebooks

Data science Python notebooks covering deep learning, machine learning, big data, and more.

28.9K
Archived
Python
Computer Vision
ML Ops
TensorFlow
#data-science#deep-learning#machine-learning

heibaiying/BigData-Notes

A comprehensive guide to big data technologies like Hadoop, Spark, Kafka, and more for developers.

16.9K
Archived
Java
Databases
#big-data#hadoop#spark

PowerJob/PowerJob

Enterprise job scheduling middleware with distributed computing ability for Java developers.

7.7K
Stable
Java
Background Jobs
CLI Tools
#job-scheduling#distributed-computing#cron

chrislusf/glow

Glow is a distributed computation system written in Go, similar to Hadoop MapReduce, Spark, and Flink.

3.2K
Archived
Go
API Frameworks
Databases
#distributed-computing#big-data#data-processing

twitter/summingbird

A streaming MapReduce library for Scala that integrates with Scalding and Storm.

2.1K
Archived
Scala
API Frameworks
Databases
Scala
#streaming#mapreduce#scalding

collabH/bigdata-growth

A comprehensive repository covering big data knowledge, including data warehouse modeling, real-time computing, Hadoop, Spark, and more.

1.7K
Stable
Shell
Databases
ETL & Pipelines
#bigdata#hadoop#spark

mahmoudparsian/data-algorithms-book

This repository provides a comprehensive guide and implementations for data algorithms using MapReduce, Spark, Java, and Scala.

1.1K
Archived
Java
Databases
ETL & Pipelines
Apache Hadoop
#data-algorithms#mapreduce#spark

Stay in the loop

Get weekly updates on trending AI coding tools and projects.