Showing 1-2 of 2 projects
Optimizes large language models for low-bit precision and sparsity, improving model compression techniques.
SmoothQuant is an efficient post-training quantization tool for large language models, enabling accurate and fast inference.
Get weekly updates on trending AI coding tools and projects.