Explore Projects

Discover 140 open source projects

Active filters (1):
Search: spark×
Clear all

Showing 61-80 of 140 projects

MIT-SPARK/TEASER-plusplus

A fast and robust C++ library for 3D point cloud registration, useful for robotics and SLAM applications.

2.2K
Stable
C++
Computer Vision
API Frameworks
#3d-reconstruction#3d-registration#optimization

zio/zio-quill

Compile-time Language Integrated Queries (LINQ) for Scala, supporting multiple databases and data sources.

2.2K
Active
Scala
API Frameworks
ORMs & Query Builders
Scala
#database#query-builder#scala

fugue-project/fugue

A unified interface for distributed computing on Spark, Dask and Ray without any rewrites.

2.1K
Active
Python
Databases
API Frameworks
Python
#distributed-computing#spark#dask

o-gs/dji-firmware-tools

Tools for handling firmware of DJI products, with a focus on quadcopters like the Inspire, Mavic, and Phantom.

2.0K
Stable
C
Firmware & Drivers
CLI Tools
#dji#firmware#reverse-engineering

moj-analytical-services/splink

Fast, accurate, and scalable probabilistic data linkage with support for multiple SQL backends.

2.0K
Active
Python
Databases
ETL & Pipelines
Python
#data-matching#data-deduplication#entity-resolution

databricks/spark-deep-learning

Deep learning library for Apache Spark that provides high-level APIs and models for building machine learning pipelines.

2.0K
Archived
Python
ML Ops
ETL & Pipelines
Apache Spark
#machine-learning#deep-learning#spark

endymecy/spark-ml-source-analysis

An open-source project that provides in-depth analysis of the source code and algorithms behind Spark's ML library.

2.0K
Archived
ML Ops
API Frameworks
#spark#machine-learning#source-code-analysis

apache/cassandra-spark-connector

A Scala connector that allows Apache Spark to interact with Apache Cassandra databases.

2.0K
Experimental
Scala
API Frameworks
Databases
Scala
#cassandra#spark#database

feathr-ai/feathr

Feathr is a scalable, unified data and AI engineering platform for enterprises, with features like feature engineering, feature governance, and a feature marketplace.

1.9K
Archived
Scala
Feature Flags
MLOps
Apache Spark
#data-engineering#feature-engineering#feature-governance

broadinstitute/gatk

Official code repository for the Genome Analysis Toolkit (GATK), a bioinformatics library for working with next-generation DNA sequencing data.

1.9K
Active
Java
ORMs & Query Builders
API Frameworks
Java
#bioinformatics#dna#genome

sparkjsdev/spark

An advanced 3D Gaussian Splatting renderer for THREE.js, useful for visualizing complex data.

1.9K
Active
TypeScript
Animation & Motion
Charts & Visualization
Three.js
#3d-rendering#data-visualization#animation

szilard/benchm-ml

A benchmarking tool for evaluating the performance of popular machine learning algorithms and libraries.

1.9K
Archived
R
ML Ops
Databases
R
#machine-learning#benchmark#performance-testing

awesome-spark/awesome-spark

A curated list of awesome Apache Spark packages and resources for developers.

1.9K
Archived
Shell

MIT-SPARK/Kimera-VIO

A C++ library for visual-inertial odometry and simultaneous localization and mapping (SLAM) with 3D mesh generation.

1.8K
Experimental
C++
Computer Vision
API Frameworks
#localization#mapping#robotics

clj-kondo/clj-kondo

A static code analyzer and linter that helps Clojure developers write clean and idiomatic code

1.8K
Active
Clojure
Linters & Formatters
#clojure#static-analysis#linter

gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties for big data applications.

1.8K
Experimental
Java
Databases
API Frameworks
#big-data#graph-database#hadoop

gege-circle/.github

A Vue-based chat application with real-time functionality

1.8K
Stable
Component Libraries (Vue/Svelte)
Authentication
Vue
#real-time#vue#chat-application

OryxProject/oryx

A distributed real-time machine learning platform built on Apache Spark and Kafka for large-scale workloads.

1.8K
Archived
Java
ML Ops
API Frameworks
Apache Spark
#real-time#machine-learning#big-data

Estom/notes

A comprehensive collection of notes and tutorials covering a wide range of topics for developers, including AI, programming languages, and more.

1.8K
Stable
Jupyter Notebook
Tutorials & Courses
Backend Frameworks
Node.js
#note-taking#tutorial#programming-languages

apachecn/.github

This GitHub repository hosts the open-source organization ApacheCN, which focuses on AI, ML, and data science tools and resources.

1.8K
Stable
CSS
LLM Frameworks
Documentation
#ai#machine-learning#python

Stay in the loop

Get weekly updates on trending AI coding tools and projects.