Showing 1-2 of 2 projects
A high-performance Transformer library for accelerating AI models on NVIDIA GPUs, including low-precision support.
Optimizes large language models for low-bit precision and sparsity, improving model compression techniques.
Get weekly updates on trending AI coding tools and projects.