Explore Projects

Discover 2 open source projects

Active filters (1):
Search: fp4ร—
Clear all

Showing 1-2 of 2 projects

NVIDIA/TransformerEngine

A high-performance Transformer library for accelerating AI models on NVIDIA GPUs, including low-precision support.

3.2K
Active
Python
LLM Frameworks
Inference
PyTorch
#deep-learning#gpu#cuda

intel/neural-compressor

Optimizes large language models for low-bit precision and sparsity, improving model compression techniques.

2.6K
Active
Python
LLM Frameworks
PyTorch
#quantization#post-training-quantization#sparsity

Stay in the loop

Get weekly updates on trending AI coding tools and projects.