xlite-dev/Awesome-LLM-Inference

A curated list of awesome papers and code for optimizing LLM/VLM inference performance

Python
AI & Machine Learning
LLM Frameworks
GPL-3.0

5.0K

Stars

347

Forks

Aug 27, 2023

Created

Feb 27, 2026

Last Updated

Project Analytics

Stars Growth (1 Month)

+74

+1.5% change

Avg Daily Growth (1 Month)

+2.6

stars per day

Fork/Star Ratio (All Time)

6.9%

Normal engagement

Lifetime Growth

5.5

stars/day over 923 days

Stars Over Time

Forks Over Time

Open Issues Over Time

Pull Requests Over Time

Commits Over Time

AI-Generated Tags

llm
inference
optimization
performance
attention-mechanisms
quantization
parallelism

Comments (0)

Sign in to leave a comment or vote

Sign In

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.