Showing 1-3 of 3 projects
SparseML provides a library for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models.
A curated list of efficient and compressed large language models for developers to explore.
A tool for structurally pruning large language models like LLaMA, BLOOM, and Vicuna to reduce their size and inference time.
Get weekly updates on trending AI coding tools and projects.