Explore Projects

Discover 140 open source projects

Active filters (1):
Search: spark×
Clear all

Showing 41-60 of 140 projects

lw-lin/CoolplaySpark

Open-source Spark codebase analysis and library for Scala developers working with Apache Spark.

3.5K
Archived
Scala
API Frameworks
Databases
Scala
#apache-spark#spark-streaming#spark-core

liyupi/sql-generator

A tool to generate structured SQL statements using JSON, built with Vue3, TypeScript, and Ant Design.

3.5K
Archived
Vue
Component Libraries (Vue/Svelte)
ORMs & Query Builders
Vue
#sql#json#vue3

apache/linkis

Apache Linkis provides a computation middleware layer to connect, govern, and orchestrate applications with data engines.

3.4K
Active
Java
MCP Servers
BaaS Platforms
#application-manager#engine#jdbc

databricks/koalas

Koalas is a pandas-like API for Apache Spark, enabling data scientists to work with big data using familiar pandas syntax.

3.4K
Archived
Python
ORMs & Query Builders
Databases
Spark
#big-data#data-science#dataframe

WeBankFinTech/DataSphereStudio

DataSphereStudio is a one-stop data application development and management portal covering data exchange, analysis, and visualization.

3.3K
Stable
Java
ETL & Pipelines
API Frameworks
Spark
#data-management#data-analysis#data-visualization

lakesoul-io/LakeSoul

LakeSoul is a cloud-native, real-time Lakehouse framework for fast data ingestion and analytics on cloud storage.

3.2K
Active
Java
API Frameworks
Databases
#big-data#lakehouse#streaming

chrislusf/glow

Glow is a distributed computation system written in Go, similar to Hadoop MapReduce, Spark, and Flink.

3.2K
Archived
Go
API Frameworks
Databases
#distributed-computing#big-data#data-processing

apache/paimon

Apache Paimon is a lake format that enables building a Realtime Lakehouse Architecture with Flink and Spark.

3.2K
Active
Java
ETL & Pipelines
Realtime
#big-data#data-ingestion#flink

spark-notebook/spark-notebook

An interactive and reactive data science platform powered by Scala and Apache Spark.

3.2K
Archived
JavaScript
Databases
ETL & Pipelines
Scala
#data-science#interactive#reactive

MoRan1607/BigDataGuide

A comprehensive guide to big data, covering various tools and technologies for learning and development.

3.1K
Active
React
#bigdata#machine learning#development

kubeflow/spark-operator

A Kubernetes operator for managing the lifecycle of Apache Spark applications on Kubernetes.

3.1K
Active
Go
API Frameworks
Containerization
Kubernetes
#apache-spark#kubernetes#kubernetes-operator

databricks/Spark-The-Definitive-Guide

This repository contains the code for Spark: The Definitive Guide, a comprehensive guide to using Apache Spark.

3.1K
Archived
Scala
API Frameworks
Databases
#apache-spark#big-data#data-processing

cirosantilli/china-dictatorship

Political activism documentation on Chinese government censorship, human rights, and censorship circumvention techniques.

2.9K
Active
HTML
Resource Collections
Privacy Tools
#censorship-circumvention#china-dictatorship#human-rights

vector4wang/spring-boot-quick

A comprehensive collection of Spring Boot-based code examples, covering a wide range of popular frameworks and tools.

2.8K
Experimental
Java
API Frameworks
IDE Extensions
Spring Boot
#spring-boot#java#api

intel/BigDL

BigDL is a distributed deep learning library that allows developers to run TensorFlow, Keras and PyTorch models on Apache Spark/Flink and Ray.

2.7K
Stable
Jupyter Notebook
Distributed Deep Learning
API Frameworks
TensorFlow
#deep-learning#distributed-computing#spark

satazor/js-spark-md5

A fast and efficient library for computing MD5 hashes in JavaScript, supporting both normal and incremental modes.

2.6K
Archived
JavaScript
General Utilities
#md5#hashing#cryptography

deeplearning4j/deeplearning4j-examples

Deeplearning4j is a deep learning library for Java and Scala, with examples for building AI/ML applications.

2.5K
Stable
Java
LLM Frameworks
API Frameworks
Java
#artificial-intelligence#deeplearning#ml-ops

unsplash/react-trend

Simple, elegant spark lines library for React developers to visualize data trends.

2.5K
Archived
JavaScript
Charts & Visualization
Frontend Frameworks
React
#data-visualization#spark-lines#react-components

geekyouth/SZT-bigdata

This is a big data analysis system for the Shenzhen metro with support for various data processing tools.

2.4K
Archived
Scala
Databases
API Frameworks
Scala
#big-data#data-analysis#metro

aftertheflood/sparks

A typeface for creating sparklines in text without code.

2.4K
Archived
CSS
Animation & Motion
Charts & Visualization
CSS
#css#visualization#sparklines

Stay in the loop

Get weekly updates on trending AI coding tools and projects.