Showing 21-40 of 46 projects
AIMET is an open-source library for advanced quantization and compression techniques in trained neural network models.
A comprehensive collection of resources for model quantization research and optimization.
Run Mixtral-8x7B language models on Colab or consumer desktops with offloading capabilities.
Implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference, for AI coding tools.
A collection of Jupyter Notebooks on various AI and machine learning concepts, including fine-tuning, inference, and LLMs.
A Python library for optimizing deep learning models for faster inference on deployment platforms like TensorRT.
A curated list of efficient and compressed large language models for developers to explore.
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
State-of-the-art audio codec with 90x compression factor for developers.
An open-source toolbox and benchmark for model compression and acceleration in PyTorch.
SmoothQuant is an efficient post-training quantization tool for large language models, enabling accurate and fast inference.
PaddleSlim is an open-source library for deep model compression and architecture search.
A minimalistic implementation of the Self Organizing Maps (SOM) algorithm for clustering and dimensionality reduction.
A toolkit to optimize machine learning models for deployment, including quantization and pruning.
An efficient C++ implementation of the RWKV language model for fast CPU inference on various bit-width quantizations.
Brevitas is a PyTorch library for neural network quantization, enabling efficient hardware acceleration on FPGAs and other devices.
A JavaScript tool for calculating token/s and GPU memory requirements for large language models like LLaMa.
Official PyTorch repository for extreme compression of large language models using additive quantization and PV-Tuning.
Efficient computing methods developed by Huawei Noah's Ark Lab for model compression and optimization.
This Python project provides a cryptocurrency trading system for the Binance exchange, using grid trading strategies.
Get weekly updates on trending AI coding tools and projects.