Explore Projects

Discover 106 open source projects

Active filters (1):
Search: large-scaleร—
Clear all

Showing 61-80 of 106 projects

facebookincubator/fastmod

A fast, regex-based code refactoring tool written in Rust to assist with large-scale codebase changes.

1.9K
Experimental
Rust
CLI Tools
API Frameworks
Rust
#cli#find-and-replace#refactoring

laekov/fastmoe

A fast implementation of Mixture of Experts (MoE) for PyTorch, enabling efficient large-scale neural networks.

1.8K
Experimental
Python
LLM Frameworks
API Frameworks
PyTorch
#mixture-of-experts#large-scale-neural-networks#pytorch-library

TurboWay/big_screen

A data visualization library for building large-scale data visualization screens.

1.8K
Active
HTML
Charts & Visualization
Full-Stack Frameworks
HTML
#data-visualization#charts#dashboards

alibaba/havenask

Havenask is a distributed information search system widely used within Alibaba Group.

1.8K
Stable
C++
API Frameworks
Search
#search#distributed-system#alibaba

gchq/Gaffer

A large-scale entity and relation database supporting aggregation of properties for big data applications.

1.8K
Experimental
Java
Databases
API Frameworks
#big-data#graph-database#hadoop

apple/ml-4m

4M: Massively Multimodal Masked Modeling, a Python library for large-scale multimodal language models

1.8K
Experimental
Python
LLM Frameworks
Databases
None
#multimodal#language-model#dataset

OryxProject/oryx

A distributed real-time machine learning platform built on Apache Spark and Kafka for large-scale workloads.

1.8K
Archived
Java
ML Ops
API Frameworks
Apache Spark
#real-time#machine-learning#big-data

scikit-learn-contrib/lightning

Large-scale linear classification, regression, and ranking library for Python developers.

1.8K
Archived
Python
ML Ops
API Frameworks
Python
#machine-learning#classification#regression

coin-or/Ipopt

An interior point optimizer library for solving large-scale nonlinear optimization problems.

1.7K
Stable
C++
Optimization
#optimization#nonlinear#scientific-computing

Tencent/paxosstore

PaxosStore is a high-performance, distributed database solution built for large-scale applications.

1.7K
Archived
C++
Databases
#distributed-database#consensus#paxos

stepjam/RLBench

RLBench is a large-scale benchmark and learning environment for reinforcement learning agents.

1.7K
Archived
Python
Agents & Orchestration
#reinforcement-learning#benchmark#learning-environment

YoongiKim/AutoCrawler

A powerful Google and Naver web crawler built with Python, Selenium, and multiprocessing for efficient large-scale data collection.

1.7K
Archived
Python
Backend & APIs
Data Pipelines
Selenium
#web-crawler#multiprocessing#data-extraction

aphrodite-engine/aphrodite-engine

A large-scale LLM inference engine built in C++ with support for various AI hardware accelerators.

1.7K
Active
C++
LLM Frameworks
Inference
#machine-learning#inference-engine#cuda

eBay/akutan

A distributed knowledge graph store built in Go for managing large-scale semantic data.

1.7K
Archived
Go
Databases
API Frameworks
#graph-database#knowledge-graph#rdf

intelligent-machine-learning/dlrover

DLRover is an automatic distributed deep learning system for training large-scale AI models on Kubernetes.

1.6K
Active
Python
ML Ops
Containerization
Python
#distributed-training#kubernetes#large-scale-ai

YelpArchive/undebt

A fast, reliable tool for performing large-scale automated code refactoring in Python projects.

1.6K
Archived
Python
CLI Tools
API Frameworks
Python
#code-refactoring#automation#python

SonyResearch/micro_diffusion

Official repository for work on micro-budget training of large-scale diffusion models for AI coding tools.

1.5K
Archived
Python
Inference
AI Code Generation
Python
#diffusion-models#machine-learning#code-generation

VAST-AI-Research/TripoSG

TripoSG is a high-fidelity 3D shape synthesis tool that uses large-scale rectified flow models for generating 3D shapes.

1.5K
Experimental
Python
Computer Vision
3D Generation
Python
#3d-generation#3d-reconstruction#computer-vision

facebookresearch/fastMRI

A large-scale dataset of raw MRI measurements and clinical MRI images for medical imaging research.

1.5K
Archived
Python
Computer Vision
Datasets
PyTorch
#medical-imaging#mri-reconstruction#deep-learning

paypal/squbs

An Akka-based toolkit for building large-scale, production-ready distributed applications in Scala.

1.4K
Archived
Scala
API Frameworks
CLI Tools
Akka
#akka#akka-streams#akka-http

Stay in the loop

Get weekly updates on trending AI coding tools and projects.