Explore Projects

Discover 106 open source projects

Active filters (1):
Search: large-scaleร—
Clear all

Showing 41-60 of 106 projects

waymo-research/waymo-open-dataset

Waymo Open Dataset is a large-scale dataset for autonomous driving research and development.

3.3K
Active
Python
Computer Vision
Datasets
Python
#autonomous-driving#dataset#computer-vision

puncsky/system-design-and-architecture

A comprehensive resource for learning system design and architecture, including interview prep.

3.2K
Archived
Makefile
Learning & Education
Backend & APIs
#system-design#architecture#interview-prep

alpa-projects/alpa

Alpa is a distributed training and serving framework for large-scale neural networks with auto-parallelization.

3.2K
Archived
Python
LLM Frameworks
API Frameworks
JAX
#distributed-computing#high-performance-computing#auto-parallelization

soundcloud/roshi

Roshi is a large-scale CRDT set implementation for timestamped events, written in Go.

3.2K
Stable
Go
API Frameworks
Databases
#crdt#distributed-systems#set-data-structure

huawei-noah/Pretrained-Language-Model

Pretrained language model and optimization techniques for large-scale distributed AI/ML development.

3.2K
Archived
Python
LLM Frameworks
Model Compression
Python
#pretrained-models#knowledge-distillation#large-scale-distributed

OpenDriveLab/AgiBot-World

An open-source large-scale manipulation platform for scalable and intelligent robotic systems.

2.8K
Stable
Python
Robotics
API Frameworks
Python
#pretraining-for-robotics#robotic-foundation-model#robotic-manipulation

thunlp/UltraChat

Large-scale, informative, and diverse multi-round chat data and models for chatbot and language model development.

2.8K
Archived
Python
LLM Frameworks
Tutorials & Courses
Python
#chatbot#chatgpt#large-language-models

OpenGVLab/InternImage

A PyTorch-based computer vision foundation model with deformable convolutions for object detection and segmentation.

2.8K
Experimental
Python
Computer Vision
API Frameworks
PyTorch
#backbone#deformable-convolution#foundation-model

mars-project/mars

A unified framework for large-scale data computation that scales popular Python data tools like NumPy, Pandas, and Scikit-Learn.

2.7K
Archived
Python
ML Ops
Caching
Dask
#machine-learning#data-processing#scale

bazelbuild/bazelisk

A user-friendly launcher for Bazel, a powerful build tool used in large-scale software development.

2.6K
Active
Go
CLI Tools
API Frameworks
#bazel#build-tool#cli-tool

detectRecog/CCPD

A diverse and well-annotated dataset for license plate detection and recognition

2.5K
Archived
Python
Computer Vision
Datasets
#ccpd#dataset#detection

camel-ai/oasis

Open-source agent-based simulation framework for large-scale AI societies and language model experiments

2.5K
Active
Python
Agents & Orchestration
LLM Frameworks
Python
#agent-based-simulation#large-language-models#multi-agent-systems

lg/murder

This Ruby project is a large-scale server deployment tool that uses BitTorrent and BitTornado, but is no longer maintained.

2.5K
Archived
Ruby
Containerization
Realtime
#deployment#torrent#server

github/glb-director

GitHub Load Balancer Director and supporting tooling for managing large-scale GitHub infrastructure.

2.4K
Active
C
API Frameworks
Containerization
Node
#load-balancing#github#infrastructure

microsoft/DialoGPT

A large-scale pretrained dialogue model for building conversational AI applications.

2.4K
Archived
Python
LLM Frameworks
API Frameworks
PyTorch
#dialogue#language-model#text-generation

google/youtube-8m

Starter code for working with the YouTube-8M dataset, a large-scale video understanding dataset.

2.4K
Archived
Python
Datasets
Python
#youtube#dataset#video-understanding

quarylabs/quary

Open-source BI platform for engineers to explore and model large-scale data pipelines.

2.4K
Active
Rust
ORMs & Query Builders
ETL & Pipelines
Rust
#analytics#big-data#data-modeling

Netflix/titus

Netflix's internal container management system for running large-scale workloads on AWS

2.0K
Archived
API Frameworks
Containerization
React
#container-management#aws#netflix

thu-coai/CDial-GPT

Large-scale Chinese Short-Text Conversation Dataset and pre-training dialog models for AI-powered developers

1.9K
Archived
Python
PyTorch
#dialogue#text-generation#gpt2

turms-im/turms

A high-performance open-source instant messaging engine for large-scale applications.

1.9K
Experimental
Java
Realtime
Authentication
Java
#chat#messaging#distributed

Stay in the loop

Get weekly updates on trending AI coding tools and projects.