Explore Projects

Discover 46 open source projects

Active filters (1):
Search: quantizationร—
Clear all

Showing 1-20 of 46 projects

hiyouga/LlamaFactory

Fine-tuning framework for 100+ LLMs & VLMs

67.9K
Active
Python
Fine-tuning
#llm#fine-tuning#ai

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K
Stable
Python
Inference
Local Inference Engines
CTranslate2
#speech-to-text#inference#quantization

ymcui/Chinese-LLaMA-Alpaca

Open-source Chinese LLaMA and Alpaca large language models for local CPU/GPU training and deployment.

19.0K
Experimental
Python
LLM Frameworks
Python
#alpaca#llama#large-language-models

UFund-Me/Qbot

An AI-powered quantitative investment research platform for algorithmic trading and backtesting.

16.4K
Experimental
Jupyter Notebook
Agents & Orchestration
Jupyter Notebook
#fintech#quantitative-trading#backtesting

artidoro/qlora

Efficient finetuning of quantized LLMs for AI developers

10.8K
Archived
Jupyter Notebook
React
#quantized LLMs#finetuning#AI development

bitsandbytes-foundation/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch developers working with AI tools.

8.0K
Active
Python
LLM Frameworks
API Frameworks
PyTorch
#llm#quantization#machine-learning

kornelski/pngquant

A high-performance PNG compressor library and CLI tool for reducing the file size of PNG images.

5.6K
Experimental
C
Image Optimization
CLI Tools
#png#compression#optimization

OpenNMT/CTranslate2

Fast C++ inference engine for Transformer models, supporting CUDA, MKL, and other optimizations.

4.3K
Active
C++
Inference
API Frameworks
#deep-learning#machine-translation#neural-machine-translation

nunchaku-ai/nunchaku

An open-source library for quantizing diffusion models to 4-bit precision, absorbing outliers through low-rank components.

3.7K
Active
Python
Diffusion Models
Quantization
PyTorch
#diffusion-models#quantization#mlops

shidenggui/easyquant

EasyQuant is a Python-based stock quantization framework for real-time market data and trading.

3.5K
Experimental
Python
React
#quantization#stock-market#real-time

mit-han-lab/llm-awq

A library for efficient weight quantization of large language models to accelerate inference on edge devices.

3.5K
Experimental
Python
LLM Frameworks
LLM Wrappers & SDKs
Python
#llm#compression#acceleration

city96/ComfyUI-GGUF

Provides GGUF quantization support for native ComfyUI models

3.3K
Active
Python
MCP Servers
React
#quantization#ComfyUI#GGUF

thu-ml/SageAttention

Quantized attention that achieves 2-5x speedup over FlashAttention for language, image, and video models.

3.2K
Active
Cuda
Inference
Quantization
PyTorch
#attention#efficient-attention#inference-acceleration

neuralmagic/deepsparse

A sparsity-aware deep learning inference runtime for CPUs, optimized for performance and efficiency.

3.2K
Experimental
Python
Inference
API Frameworks
PyTorch
#computer-vision#nlp#object-detection

huawei-noah/Pretrained-Language-Model

Pretrained language model and optimization techniques for large-scale distributed AI/ML development.

3.2K
Archived
Python
LLM Frameworks
Model Compression
Python
#pretrained-models#knowledge-distillation#large-scale-distributed

IntelLabs/nlp-architect

A model library for exploring state-of-the-art deep learning techniques for optimizing NLP neural networks.

2.9K
Archived
Python
LLM Frameworks
API Frameworks
PyTorch
#nlp#deep-learning#transformers

turboderp/exllama

A memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

2.9K
Archived
Python
LLM Wrappers & SDKs
CLI Tools
Python
#llama#transformers#quantized-weights

nunchaku-ai/ComfyUI-nunchaku

ComfyUI Plugin of Nunchaku, a developer tool for AI-powered coding and rapid prototyping.

2.8K
Active
Python
AI Code Generation
LLM Frameworks
Python
#comfyui#diffusion#flux

aaron-xichen/pytorch-playground

A PyTorch repository providing pre-trained models and datasets for common computer vision tasks.

2.7K
Archived
Python
ML Ops
Databases
PyTorch
#pytorch#computer-vision#datasets

intel/neural-compressor

Optimizes large language models for low-bit precision and sparsity, improving model compression techniques.

2.6K
Active
Python
LLM Frameworks
PyTorch
#quantization#post-training-quantization#sparsity

Stay in the loop

Get weekly updates on trending AI coding tools and projects.