Explore Projects

Discover 5 open source projects

Active filters (1):

Search: blackwell×

Clear all

Showing 1-5 of 5 projects

vllm-project/vllm

High-throughput LLM inference engine for developers

72.1K

Active

Python

Inference

LLM Wrappers & SDKs

Hugging Face

#llm#inference#ai

sgl-project/sglang

High-performance serving framework for large language and multimodal models

24.1K

Active

Python

Inference

LLM Frameworks

Python

#llm#inference#serving

NVIDIA/TensorRT-LLM

TensorRT LLM provides a Python API and optimizations to efficiently run large language models on NVIDIA GPUs.

13.0K

Active

Python

LLM Frameworks

PyTorch

#cuda#llm-serving#moe

NVIDIA/TransformerEngine

A high-performance Transformer library for accelerating AI models on NVIDIA GPUs, including low-precision support.

3.2K

Active

Python

LLM Frameworks

Inference

PyTorch

#deep-learning#gpu#cuda

GradientHQ/parallax

Parallax is a distributed model serving framework that lets you build your own AI cluster anywhere.

1.1K

Active

Python

LLM Frameworks

API Frameworks

PyTorch

#distributed-systems#decentralized-inference#llm-serving

Stay in the loop

Get weekly updates on trending AI coding tools and projects.