Showing 101-120 of 140 projects
This is a book that teaches how to use Apache Spark for lightning-fast data analytics.
A collection of 50+ Docker images for DevOps tools, CI/CD, Hadoop, Kafka, Cassandra, and more.
Dr. Elephant is a performance monitoring and tuning tool for Apache Hadoop and Apache Spark.
A PyTorch implementation of a BERT-style pretraining method for convolutional networks, enabling more efficient self-supervised learning.
Provides Jupyter magics and kernels for working with remote Spark clusters, enabling data scientists to easily interact with Spark from Jupyter Notebooks.
A collection of PySpark examples covering RDD, DataFrame, and Dataset operations in Python.
An open-source big data management platform that helps developers build scalable cloud-native data applications.
A simple Android sparkline chart view library for displaying data trends.
A big data development platform for submission, scheduling, operation and maintenance, and indicator information display.
PySpark-Tutorial provides basic algorithms using PySpark for big data analytics and data processing.
Distributed deep learning on Hadoop and Spark clusters for vibe coders.
A high-performance profiler for Minecraft clients, servers, and proxies written in Java.
An AI-powered writing companion that helps spark creative inspiration for novel writing
A library for time series analysis on Apache Spark, enabling efficient large-scale time series processing.
A simple, elegant spark lines library for Vue.js developers
This repository provides the official Apache Spark documentation in Chinese, a popular big data processing framework.
A reference application showcasing the integration of streaming and batch data processing with Apache Spark Streaming, Cassandra, Kafka, and Akka.
LakeSail is a Rust-based computation framework that unifies batch processing, stream processing, and AI workloads.
A curated collection of websites to inspire creativity and design for developers.
Scalable identity resolution, entity resolution, data mastering and deduplication using ML
Get weekly updates on trending AI coding tools and projects.