Explore Projects

Discover 2 open source projects

Active filters (1):
Search: smoothquantร—
Clear all

Showing 1-2 of 2 projects

intel/neural-compressor

Optimizes large language models for low-bit precision and sparsity, improving model compression techniques.

2.6K
Active
Python
LLM Frameworks
PyTorch
#quantization#post-training-quantization#sparsity

mit-han-lab/smoothquant

SmoothQuant is an efficient post-training quantization tool for large language models, enabling accurate and fast inference.

1.6K
Archived
Python
LLM Frameworks
Inference
Python
#quantization#large-language-models#performance-optimization

Stay in the loop

Get weekly updates on trending AI coding tools and projects.