Showing 1-7 of 7 projects
Data science Python notebooks covering deep learning, machine learning, big data, and more.
A comprehensive guide to big data technologies like Hadoop, Spark, Kafka, and more for developers.
Enterprise job scheduling middleware with distributed computing ability for Java developers.
Glow is a distributed computation system written in Go, similar to Hadoop MapReduce, Spark, and Flink.
A streaming MapReduce library for Scala that integrates with Scalding and Storm.
A comprehensive repository covering big data knowledge, including data warehouse modeling, real-time computing, Hadoop, Spark, and more.
This repository provides a comprehensive guide and implementations for data algorithms using MapReduce, Spark, Java, and Scala.
Get weekly updates on trending AI coding tools and projects.