Explore Projects

Discover 5 open source projects

Active filters (1):
Search: blackwellร—
Clear all

Showing 1-5 of 5 projects

vllm-project/vllm

High-throughput LLM inference engine for developers

72.1K
Active
Python
Inference
LLM Wrappers & SDKs
Hugging Face
#llm#inference#ai

sgl-project/sglang

High-performance serving framework for large language and multimodal models

24.1K
Active
Python
Inference
LLM Frameworks
Python
#llm#inference#serving

NVIDIA/TensorRT-LLM

TensorRT LLM provides a Python API and optimizations to efficiently run large language models on NVIDIA GPUs.

13.0K
Active
Python
LLM Frameworks
PyTorch
#cuda#llm-serving#moe

NVIDIA/TransformerEngine

A high-performance Transformer library for accelerating AI models on NVIDIA GPUs, including low-precision support.

3.2K
Active
Python
LLM Frameworks
Inference
PyTorch
#deep-learning#gpu#cuda

GradientHQ/parallax

Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere.

1.1K
Active
Python
LLM Frameworks
API Frameworks
PyTorch
#distributed-systems#decentralized-inference#llm-serving

Stay in the loop

Get weekly updates on trending AI coding tools and projects.