Explore Projects

Discover 46 open source projects

Active filters (1):

Search: quantization×

Clear all

Showing 1-20 of 46 projects

hiyouga/LlamaFactory

Fine-tuning framework for 100+ LLMs & VLMs

67.9K

Active

Python

Fine-tuning

#llm#fine-tuning#ai

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K

Stable

Python

Inference

Local Inference Engines

CTranslate2

#speech-to-text#inference#quantization

ymcui/Chinese-LLaMA-Alpaca

Open-source Chinese LLaMA and Alpaca large language models for local CPU/GPU training and deployment.

19.0K

Experimental

Python

LLM Frameworks

Python

#alpaca#llama#large-language-models

UFund-Me/Qbot

An AI-powered quantitative investment research platform for algorithmic trading and backtesting.

16.4K

Experimental

Jupyter Notebook

Agents & Orchestration

Jupyter Notebook

#fintech#quantitative-trading#backtesting

artidoro/qlora

Efficient finetuning of quantized LLMs for AI developers

10.8K

Archived

Jupyter Notebook

React

#quantized LLMs#finetuning#AI development

bitsandbytes-foundation/bitsandbytes

Accessible large language models via k-bit quantization for PyTorch developers working with AI tools.

8.0K

Active

Python

LLM Frameworks

API Frameworks

PyTorch

#llm#quantization#machine-learning

kornelski/pngquant

A high-performance PNG compressor library and CLI tool for reducing the file size of PNG images.

5.6K

Experimental

Image Optimization

CLI Tools

#png#compression#optimization

OpenNMT/CTranslate2

Fast C++ inference engine for Transformer models, supporting CUDA, MKL, and other optimizations.

4.3K

Active

C++

Inference

API Frameworks

#deep-learning#machine-translation#neural-machine-translation

nunchaku-ai/nunchaku

An open-source library for quantizing diffusion models to 4-bit precision, absorbing outliers through low-rank components.

3.7K

Active

Python

Diffusion Models

Quantization

PyTorch

#diffusion-models#quantization#mlops

shidenggui/easyquant

EasyQuant is a Python-based stock quantization framework for real-time market data and trading.

3.5K

Experimental

Python

React

#quantization#stock-market#real-time

mit-han-lab/llm-awq

A library for efficient weight quantization of large language models to accelerate inference on edge devices.

3.5K

Experimental

Python

LLM Frameworks

LLM Wrappers & SDKs

Python

#llm#compression#acceleration

city96/ComfyUI-GGUF

Provides GGUF quantization support for native ComfyUI models

3.3K

Active

Python

MCP Servers

React

#quantization#ComfyUI#GGUF

thu-ml/SageAttention

Quantized attention that achieves 2-5x speedup over FlashAttention for language, image, and video models.

3.2K

Active

Cuda

Inference

Quantization

PyTorch

#attention#efficient-attention#inference-acceleration

neuralmagic/deepsparse

A sparsity-aware deep learning inference runtime for CPUs, optimized for performance and efficiency.

3.2K

Experimental

Python

Inference

API Frameworks

PyTorch

#computer-vision#nlp#object-detection

huawei-noah/Pretrained-Language-Model

Pretrained language model and optimization techniques for large-scale distributed AI/ML development.

3.2K

Archived

Python

LLM Frameworks

Model Compression

Python

#pretrained-models#knowledge-distillation#large-scale-distributed

IntelLabs/nlp-architect

A model library for exploring state-of-the-art deep learning techniques for optimizing NLP neural networks.

2.9K

Archived

Python

LLM Frameworks

API Frameworks

PyTorch

#nlp#deep-learning#transformers

turboderp/exllama

A memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.

2.9K

Archived

Python

LLM Wrappers & SDKs

CLI Tools

Python

#llama#transformers#quantized-weights

nunchaku-ai/ComfyUI-nunchaku

ComfyUI Plugin of Nunchaku, a developer tool for AI-powered coding and rapid prototyping.

2.8K

Active

Python

AI Code Generation

LLM Frameworks

Python

#comfyui#diffusion#flux

aaron-xichen/pytorch-playground

A PyTorch repository providing pre-trained models and datasets for common computer vision tasks.

2.7K

Archived

Python

ML Ops

Databases

PyTorch

#pytorch#computer-vision#datasets

intel/neural-compressor

Optimizes large language models for low-bit precision and sparsity, improving model compression techniques.

2.6K

Active

Python

LLM Frameworks

PyTorch

#quantization#post-training-quantization#sparsity

2 3

Stay in the loop

Get weekly updates on trending AI coding tools and projects.