Showing 1-1 of 1 projects
A tool for structurally pruning large language models like LLaMA, BLOOM, and Vicuna to reduce their size and inference time.
Get weekly updates on trending AI coding tools and projects.