Showing 41-56 of 56 projects
Data processing on Hadoop without the hassle, written in Clojure.
Dr. Elephant is a performance monitoring and tuning tool for Apache Hadoop and Apache Spark.
A big data development platform for submission, scheduling, operation and maintenance, and indicator information display.
Apache Impala is a high-performance, open-source, SQL query engine that runs on Apache Hadoop and Apache Kudu.
Distributed deep learning on Hadoop and Spark clusters for vibe coders.
Hadoop docker image for running Hadoop clusters in a containerized environment.
Scalable, reliable, distributed storage system optimized for data analytics and object store workloads.
A comprehensive collection of Nagios plugins for monitoring AWS, Hadoop, Cloud, Kafka, and other popular technologies.
Twitter's collection of LZO and Protocol Buffer-related Hadoop, Pig, Hive, and HBase code.
A curated list of resources for the Hadoop ecosystem, not a developer discovery platform focused on vibe coders.
Kylo is an enterprise-grade data lake management platform built on big data technologies like Spark and Hadoop.
A big data platform for analyzing e-commerce user behavior using Hadoop, Spark, and Java.
This repository provides a comprehensive guide and implementations for data algorithms using MapReduce, Spark, Java, and Scala.
A collection of study notes, ebooks, and resources on big data, machine learning, Linux, and more for developers.
Apache Ranger is a data security framework for the Hadoop platform, providing comprehensive access control and auditing capabilities.
Python module that simplifies writing and running Hadoop programs.
Get weekly updates on trending AI coding tools and projects.