Inference

Explore 351 open source projects in Inference

Showing 141-160 of 351 projects

onnx/onnx-tensorrt

ONNX-TensorRT is a C++ library that provides a TensorRT backend for the ONNX deep learning framework.

3.2K
Stable
C++
Inference
API Frameworks
#deep-learning#nvidia#onnx

NVIDIA/TransformerEngine

A high-performance Transformer library for accelerating AI models on NVIDIA GPUs, including low-precision support.

3.2K
Active
Python
LLM Frameworks
Inference
PyTorch
#deep-learning#gpu#cuda

schrodingercatss/tuning_playbook_zh_cn

A tactical manual to systematically maximize the performance of your deep learning models.

3.2K
Archived
Fine-tuning
Inference
#deep-learning#model-optimization#performance-tuning

neuralmagic/deepsparse

A sparsity-aware deep learning inference runtime for CPUs, optimized for performance and efficiency.

3.2K
Experimental
Python
Inference
API Frameworks
PyTorch
#computer-vision#nlp#object-detection

hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation powered by AI.

3.1K
Active
Python
Computer Vision
Inference
PyTorch
#video-generation#diffusion-models#distillation

ryankiros/neural-storyteller

A recurrent neural network for generating little stories about images.

3.0K
Archived
Python
Computer Vision
LLM Frameworks
Python
#machine-learning#natural-language-processing#image-captioning

evilsocket/cake

A distributed inference engine for large language models and StableDiffusion on mobile, desktop and server.

3.0K
Archived
Rust
LLM Frameworks
Inference
#llm#stable-diffusion#inference

vllm-project/vllm-omni

A Python framework for efficient model inference with omni-modality AI models.

2.9K
Active
Python
Inference
Multimodal
PyTorch
#audio-generation#diffusion#image-generation

ruvnet/ruvector

High-performance vector graph neural network database in Rust for real-time AI inference and graph ML.

2.9K
Active
Rust
Inference
Vector Databases
Rust
#vector-database#gnn#graph-neural-networks

sonos/tract

A lightweight, self-contained Rust library for running Tensorflow and ONNX models with no dependencies

2.8K
Active
Rust
Inference
API Frameworks
#tensorflow#onnx#inference

rom1504/clip-retrieval

Easily compute CLIP embeddings and build a CLIP-based retrieval system with this Jupyter Notebook library.

2.7K
Stable
Jupyter Notebook
Computer Vision
Inference
Jupyter Notebook
#clip#retrieval#computer-vision

FasterDecoding/Medusa

A simple framework for accelerating LLM generation with multiple decoding heads

2.7K
Archived
Jupyter Notebook
LLM Frameworks
Inference
Jupyter Notebook
#llm#generation#inference

airockchip/rknn-toolkit2

This repository provides an AI-powered toolkit for developing applications with the Rockchip RKNN inference engine.

2.7K
Experimental
C
Inference
API Frameworks
#rknn#rockchip#ai-inference

facebookresearch/sam-3d-body

This repository provides code and models for running inference with the SAM 3D Body Model, a tool for 3D body reconstruction.

2.7K
Stable
Python
Computer Vision
Inference
Python
#3d-body#computer-vision#inference

autodistill/autodistill

An AI-powered tool for training supervised models without manual labeling, using foundation models and multimodal learning.

2.6K
Experimental
Python
Computer Vision
Model Distillation
PyTorch
#auto-labeling#computer-vision#foundation-models

containers/ramalama

RamaLama simplifies local serving of AI models and enables their use for inference in production via containers.

2.6K
Active
Python
LLM Frameworks
Inference
Python
#ai#containers#inference-server

pyro-ppl/numpyro

A probabilistic programming library powered by NumPy and JAX for Bayesian inference and MCMC sampling.

2.6K
Active
Python
Inference
Caching
NumPy
#bayesian-inference#hmc#jax

llm-d/llm-d

Achieve state-of-the-art inference performance on modern accelerators with this Kubernetes-based solution.

2.6K
Active
Shell
Inference
Containerization
#kubernetes#inference#performance

znxlwm/UGATIT-pytorch

Official PyTorch implementation of U-GAT-IT, an unsupervised generative adversarial network for image-to-image translation.

2.5K
Archived
Python
Computer Vision
Inference
PyTorch
#computer-vision#image-translation#generative-adversarial-network

exacity/simplified-deeplearning

Simplified implementations of deep learning related works for developers interested in AI and machine learning.

2.5K
Archived
Jupyter Notebook
ML Ops
Inference
#deep-learning#machine-learning#tutorial
1...79...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.