Showing 1-3 of 3 projects
A high-performance library for efficient neural network pruning and compression across LLMs, vision models, and more.
A comprehensive collection of resources for model quantization research and optimization.
A comprehensive survey of efficient large language models for AI and machine learning systems.
Get weekly updates on trending AI coding tools and projects.