FMInference/FlexLLMGen

A Python library for running large language models on a single GPU for high-throughput scenarios.

Python
AI & Machine Learning
LLM Frameworks
Apache-2.0

9.4K

Stars

590

Forks

Feb 15, 2023

Created

Oct 28, 2024

Last Updated

Project Analytics

Stars Growth (1 Month)

-2

-0.0% change

Avg Daily Growth (1 Month)

-0.1

stars per day

Fork/Star Ratio (All Time)

6.3%

Normal engagement

Lifetime Growth

8.4

stars/day over 1.1K days

Stars Over Time

Forks Over Time

Open Issues Over Time

Pull Requests Over Time

Commits Over Time

AI-Generated Tags

large-language-models
high-throughput
gpu-optimization
deep-learning
machine-learning

Comments (0)

Sign in to leave a comment or vote

Sign In

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.