Showing 1-3 of 3 projects
A sparsity-aware deep learning inference runtime for CPUs, optimized for performance and efficiency.
SparseML provides a library for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models.
CRATE is a Python library that enables efficient compression and sparsification of transformer-based models.
Get weekly updates on trending AI coding tools and projects.