Inference

Explore 351 open source projects in Inference

Showing 101-120 of 351 projects

open-edge-platform/anomalib

An open-source anomaly detection library with state-of-the-art algorithms and features like experiment management and edge inference.

5.4K
Active
Python
Computer Vision
Inference
Python
#anomaly-detection#anomaly-localization#anomaly-segmentation

NVIDIA/tacotron2

A PyTorch implementation of Tacotron 2, a state-of-the-art text-to-speech model, with faster-than-realtime inference.

5.3K
Archived
Jupyter Notebook
Speech & Audio
Inference
PyTorch
#text-to-speech#audio-generation#machine-learning

flashinfer-ai/flashinfer

A Python library for serving large language models (LLMs) with high performance, including GPU acceleration and distributed inference.

5.1K
Active
Python
LLM Frameworks
Inference
PyTorch
#llm#inference#cuda

xlite-dev/Awesome-LLM-Inference

A curated list of awesome papers and code for optimizing LLM/VLM inference performance

5.0K
Active
Python
LLM Frameworks
LLM Wrappers & SDKs
#llm#inference#optimization

NVlabs/Sana

SANA is an efficient high-resolution image synthesis library using a linear diffusion transformer.

5.0K
Active
Python
Computer Vision
Inference
PyTorch
#diffusion#text-to-image-generation#transformers

h2oai/h2o-llmstudio

H2O LLM Studio is a no-code GUI framework for fine-tuning large language models (LLMs) like GPT-3, LLAMA, and ChatGPT.

4.9K
Active
Python
LLM Frameworks
Fine-tuning
Python
#ai#chatbot#chatgpt

NVIDIA-AI-IOT/torch2trt

An easy-to-use PyTorch to TensorRT converter for optimizing AI model inference on NVIDIA Jetson devices.

4.9K
Archived
Python
Inference
API Frameworks
PyTorch
#pytorch#tensorrt#jetson

shibing624/MedicalGPT

This repository allows developers to train their own medical language models using the ChatGPT training pipeline.

4.8K
Active
Python
LLM Frameworks
Fine-tuning
Python
#chatgpt#gpt#llama

thunlp/OpenPrompt

An open-source framework for prompt-learning, a powerful technique for fine-tuning language models.

4.8K
Archived
Python
LLM Frameworks
Fine-tuning
PyTorch
#prompt-learning#language-models#natural-language-processing

facebookincubator/AITemplate

AITemplate is a Python framework for rendering neural networks into high-performance CUDA/HIP C++ code, optimized for GPU inference.

4.7K
Active
Python
Inference
ML Ops
Python
#cuda#hip#c++

gpustack/gpustack

Optimize AI inference performance on GPUs with this Python library for selecting and tuning inference engines.

4.6K
Active
Python
Inference
CLI Tools
Python
#ai-inference#gpu-acceleration#performance-optimization

huggingface/text-embeddings-inference

A blazing fast inference solution for text embeddings models built with Rust.

4.6K
Active
Rust
LLM Frameworks
Inference
#embeddings#inference#text-processing

lightvector/KataGo

KataGo is an open-source Go engine and self-play learning platform for AI research and development.

4.5K
Active
C++
Agents & Orchestration
Inference
#go#ai#reinforcement-learning

huawei-noah/Efficient-AI-Backbones

Efficient AI model backbones developed by Huawei's Noah's Ark Lab, including GhostNet, TNT, and MLP.

4.4K
Experimental
Python
Computer Vision
Model Compression
PyTorch
#convolutional-neural-networks#efficient-inference#ghostnet

xlite-dev/lite.ai.toolkit

A lite C++ AI toolkit with 100+ models for computer vision tasks like detection, segmentation, and image generation.

4.4K
Active
C++
Computer Vision
Inference
#computer-vision#inference#cli

showlab/Tune-A-Video

Tune-A-Video is a one-shot text-to-video generation tool that fine-tunes image diffusion models.

4.4K
Archived
Python
Computer Vision
Fine-tuning
Python
#text-to-video#diffusion-models#fine-tuning

openvinotoolkit/open_model_zoo

A collection of pre-trained deep learning models and demos optimized for high performance using the OpenVINO toolkit.

4.4K
Active
Python
Inference
ML Ops
PyTorch
#deep-learning#model-zoo#openvino

OpenNMT/CTranslate2

Fast C++ inference engine for Transformer models, supporting CUDA, MKL, and other optimizations.

4.3K
Active
C++
Inference
API Frameworks
#deep-learning#machine-translation#neural-machine-translation

VectorSpaceLab/OmniGen

OmniGen is a unified image generation library that supports diffusion models, multi-modal and multi-task learning.

4.3K
Stable
Jupyter Notebook
Computer Vision
Image & Video
Jupyter Notebook
#diffusion#image-generation#multi-modal

Tencent-Hunyuan/HunyuanDiT

A powerful multi-resolution diffusion transformer for fine-grained Chinese text understanding and generation.

4.3K
Stable
Jupyter Notebook
LLM Frameworks
Fine-tuning
#ai-text-generation#chinese-language-model#multi-resolution-diffusion
1...57...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.