Explore Projects

Discover 1 open source projects

Active filters (1):
Search: large-large-modelsร—
Clear all

Showing 1-1 of 1 projects

flashinfer-ai/flashinfer

A Python library for serving large language models (LLMs) with high performance, including GPU acceleration and distributed inference.

5.1K
Active
Python
LLM Frameworks
Inference
PyTorch
#llm#inference#cuda

Stay in the loop

Get weekly updates on trending AI coding tools and projects.