Inference

Explore 351 open source projects in Inference

Showing 301-320 of 351 projects

princeton-nlp/MeZO

MeZO: A novel fine-tuning method for language models that requires just forward passes, ideal for vibe coders.

1.2K
Archived
Python
LLM Frameworks
Fine-tuning
Python
#language-models#fine-tuning#efficient-training

showlab/Show-1

A text-to-video generation model that combines pixel-level and latent diffusion approaches.

1.1K
Stable
Python
Computer Vision
Inference
Python
#text-to-video#diffusion-models#computer-vision

frotms/PaddleOCR2Pytorch

A PyTorch-based implementation of the PaddleOCR optical character recognition model, enabling developers to leverage state-of-the-art AI for text detection and recognition.

1.1K
Stable
Python
Computer Vision
Inference
PyTorch
#ocr#text-detection#text-recognition

pix2pixzero/pix2pix-zero

A zero-shot image-to-image translation library for developers building AI-powered creative applications.

1.1K
Archived
Python
Computer Vision
Inference
Python
#computer-vision#image-translation#zero-shot

eBay/bayesian-belief-networks

A Python library for creating and performing exact inference on Bayesian Belief Networks.

1.1K
Archived
Python
Inference
API Frameworks
#bayesian-networks#probabilistic-modeling#machine-learning

rohitgandikota/sliders

A Jupyter Notebook project for precise control and experimentation with diffusion models.

1.1K
Experimental
Jupyter Notebook
LLM Frameworks
Inference
#diffusion-models#machine-learning#jupyter-notebook

huggingface/search-and-learn

Recipes to scale inference-time compute of open models for AI/ML developers.

1.1K
Experimental
Python
Inference
API Frameworks
Python
#ai-models#model-optimization#inference-speed

openvinotoolkit/nncf

Neural Network Compression Framework for enhanced OpenVINO™ inference

1.1K
Active
Python
Inference
Computer Vision
PyTorch
#openvino#quantization#pruning

basetenlabs/truss

The simplest way to serve AI/ML models in production, with support for popular models like Stable Diffusion and Whisper.

1.1K
Active
Python
Inference
API Development
Flask
#artificial-intelligence#machine-learning#model-serving

PrunaAI/pruna

Pruna is a model optimization framework that helps developers deliver faster, more efficient AI models with minimal overhead.

1.1K
Active
Python
ML Ops
Inference
Python
#model-optimization#machine-learning#ai-tooling

chongzhou96/EdgeSAM

Official PyTorch implementation of a prompt-based distillation method for on-device deployment of the Segment Anything Model (SAM).

1.1K
Experimental
Jupyter Notebook
Computer Vision
Inference
PyTorch
#computer-vision#on-device-ai#segmentation

arpitbansal297/Cold-Diffusion-Models

Official PyTorch implementation of Cold-Diffusion, a technique for different transformations.

1.1K
Archived
Python
Inference
Computer Vision
PyTorch
#diffusion-models#computer-vision#image-transformation

horseee/LLM-Pruner

A tool for structurally pruning large language models like LLaMA, BLOOM, and Vicuna to reduce their size and inference time.

1.1K
Archived
Python
LLM Frameworks
Inference
Python
#compression#pruning#large-language-models

THU-LYJ-Lab/T3Bench

A Python benchmark suite for evaluating text-to-3D generation models and techniques.

1.1K
Archived
Python
Computer Vision
Inference
Python
#3d-generation#text-to-3d#diffusion

Woolverine94/biniou

A self-hosted web UI for over 30 generative AI tools, including Stable Diffusion, GFPGAN, and Whisper.

1.1K
Active
Python
AI App Builders
Inference
React
#generative-ai#stable-diffusion#diffusers

yangheng95/PyABSA

A comprehensive library for sentiment analysis, text classification, and text adversarial defense, tailored for AI-powered developers.

1.1K
Active
Jupyter Notebook
Agents & Orchestration
Inference
PyTorch
#sentiment-analysis#text-classification#text-augmentation

fpgaminer/joycaption

JoyCaption is an open, uncensored image captioning Visual Language Model (VLM) for training Diffusion models.

1.1K
Stable
Jupyter Notebook
LLM Frameworks
Computer Vision
Jupyter Notebook
#captioning#joycaption#vlm

IDEA-Research/Grounding-DINO-1.5-API

A powerful open-world object detection model for computer vision tasks, leveraging the DINO framework.

1.1K
Archived
Python
Computer Vision
Inference
PyTorch
#object-detection#open-world#zero-shot

tspeterkim/flash-attention-minimal

A minimal implementation of the Flash Attention algorithm in CUDA for efficient AI model inference.

1.1K
Archived
Cuda
LLM Frameworks
Inference
#cuda#attention-mechanism#deep-learning

PaddlePaddle/Paddle.js

Paddle.js is a web project for the Baidu PaddlePaddle deep learning framework, enabling browser-based inference.

1.1K
Archived
JavaScript
Inference
Frontend Frameworks
React
#deep-learning#inference-engine#paddlepaddle
1...151718

Stay in the loop

Get weekly updates on trending AI coding tools and projects.