Explore Projects

Discover 2 open source projects

Active filters (1):
Search: fastertransformerร—
Clear all

Showing 1-2 of 2 projects

InternLM/lmdeploy

LMDeploy is a toolkit for compressing, deploying, and serving large language models (LLMs).

7.7K
Active
Python
LLM Frameworks
Inference
Python
#llm#inference#deployment

NVIDIA/FasterTransformer

A high-performance C++ library for optimizing transformers like BERT and GPT for AI and machine learning applications.

6.4K
Archived
C++
LLM Frameworks
API Frameworks
PyTorch
#transformer#bert#gpt

Stay in the loop

Get weekly updates on trending AI coding tools and projects.