Inference

Explore 351 open source projects in Inference

Showing 101-120 of 351 projects

open-edge-platform/anomalib

An open-source anomaly detection library with state-of-the-art algorithms and features like experiment management and edge inference.

5.4K

Active

Python

Computer Vision

Inference

Python

#anomaly-detection#anomaly-localization#anomaly-segmentation

NVIDIA/tacotron2

A PyTorch implementation of Tacotron 2, a state-of-the-art text-to-speech model, with faster-than-realtime inference.

5.3K

Archived

Jupyter Notebook

Speech & Audio

Inference

PyTorch

#text-to-speech#audio-generation#machine-learning

flashinfer-ai/flashinfer

A Python library for serving large language models (LLMs) with high performance, including GPU acceleration and distributed inference.

5.1K

Active

Python

LLM Frameworks

Inference

PyTorch

#llm#inference#cuda

xlite-dev/Awesome-LLM-Inference

A curated list of awesome papers and code for optimizing LLM/VLM inference performance

5.0K

Active

Python

LLM Frameworks

LLM Wrappers & SDKs

#llm#inference#optimization

NVlabs/Sana

SANA is an efficient high-resolution image synthesis library using a linear diffusion transformer.

5.0K

Active

Python

Computer Vision

Inference

PyTorch

#diffusion#text-to-image-generation#transformers

h2oai/h2o-llmstudio

H2O LLM Studio is a no-code GUI framework for fine-tuning large language models (LLMs) like GPT-3, LLAMA, and ChatGPT.

4.9K

Active

Python

LLM Frameworks

Fine-tuning

Python

#ai#chatbot#chatgpt

NVIDIA-AI-IOT/torch2trt

An easy-to-use PyTorch to TensorRT converter for optimizing AI model inference on NVIDIA Jetson devices.

4.9K

Archived

Python

Inference

API Frameworks

PyTorch

#pytorch#tensorrt#jetson

shibing624/MedicalGPT

This repository allows developers to train their own medical language models using the ChatGPT training pipeline.

4.8K

Active

Python

LLM Frameworks

Fine-tuning

Python

#chatgpt#gpt#llama

thunlp/OpenPrompt

An open-source framework for prompt-learning, a powerful technique for fine-tuning language models.

4.8K

Archived

Python

LLM Frameworks

Fine-tuning

PyTorch

#prompt-learning#language-models#natural-language-processing

facebookincubator/AITemplate

AITemplate is a Python framework for rendering neural networks into high-performance CUDA/HIP C++ code, optimized for GPU inference.

4.7K

Active

Python

Inference

ML Ops

Python

#cuda#hip#c++

gpustack/gpustack

Optimize AI inference performance on GPUs with this Python library for selecting and tuning inference engines.

4.6K

Active

Python

Inference

CLI Tools

Python

#ai-inference#gpu-acceleration#performance-optimization

huggingface/text-embeddings-inference

A blazing fast inference solution for text embeddings models built with Rust.

4.6K

Active

Rust

LLM Frameworks

Inference

#embeddings#inference#text-processing

lightvector/KataGo

KataGo is an open-source Go engine and self-play learning platform for AI research and development.

4.5K

Active

C++

Agents & Orchestration

Inference

#go#ai#reinforcement-learning

huawei-noah/Efficient-AI-Backbones

Efficient AI model backbones developed by Huawei's Noah's Ark Lab, including GhostNet, TNT, and MLP.

4.4K

Experimental

Python

Computer Vision

Model Compression

PyTorch

#convolutional-neural-networks#efficient-inference#ghostnet

xlite-dev/lite.ai.toolkit

A lite C++ AI toolkit with 100+ models for computer vision tasks like detection, segmentation, and image generation.

4.4K

Active

C++

Computer Vision

Inference

#computer-vision#inference#cli

showlab/Tune-A-Video

Tune-A-Video is a one-shot text-to-video generation tool that fine-tunes image diffusion models.

4.4K

Archived

Python

Computer Vision

Fine-tuning

Python

#text-to-video#diffusion-models#fine-tuning

openvinotoolkit/open_model_zoo

A collection of pre-trained deep learning models and demos optimized for high performance using the OpenVINO toolkit.

4.4K

Active

Python

Inference

ML Ops

PyTorch

#deep-learning#model-zoo#openvino

OpenNMT/CTranslate2

Fast C++ inference engine for Transformer models, supporting CUDA, MKL, and other optimizations.

4.3K

Active

C++

Inference

API Frameworks

#deep-learning#machine-translation#neural-machine-translation

VectorSpaceLab/OmniGen

OmniGen is a unified image generation library that supports diffusion models, multi-modal and multi-task learning.

4.3K

Stable

Jupyter Notebook

Computer Vision

Image & Video

Jupyter Notebook

#diffusion#image-generation#multi-modal

Tencent-Hunyuan/HunyuanDiT

A powerful multi-resolution diffusion transformer for fine-grained Chinese text understanding and generation.

4.3K

Stable

Jupyter Notebook

LLM Frameworks

Fine-tuning

#ai-text-generation#chinese-language-model#multi-resolution-diffusion

1...57...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.