microsoft/LLMLingua

This project aims to speed up large language model (LLM) inference and enhance their understanding of key information through prompt and KV-Cache compression.

Python
AI & Machine Learning
LLM Frameworks
MIT

5.9K

Stars

351

Forks

Jul 7, 2023

Created

Oct 28, 2025

Last Updated

Project Analytics

Stars Growth (1 Month)

+65

+1.1% change

Avg Daily Growth (1 Month)

+2.3

stars per day

Fork/Star Ratio (All Time)

6.0%

Normal engagement

Lifetime Growth

6.0

stars/day over 974 days

Stars Over Time

Forks Over Time

Open Issues Over Time

Pull Requests Over Time

Commits Over Time

AI-Generated Tags

llm
language-model
inference-optimization
prompt-engineering
compression

Comments (0)

Sign in to leave a comment or vote

Sign In

No comments yet. Be the first to comment!

Stay in the loop

Get weekly updates on trending AI coding tools and projects.