Explore Projects

Discover 321 open source projects

Active filters (1):

Search: inference×

Clear all

Showing 141-160 of 321 projects

pytorch/TensorRT

PyTorch compiler for NVIDIA GPUs using TensorRT, enabling efficient deep learning inference on CUDA hardware.

3.0K

Active

Python

ML Ops

API Frameworks

PyTorch

#deep-learning#cuda#nvidia

evilsocket/cake

A distributed inference engine for large language models and StableDiffusion on mobile, desktop and server.

3.0K

Archived

Rust

LLM Frameworks

Inference

#llm#stable-diffusion#inference

vllm-project/vllm-omni

A Python framework for efficient model inference with omni-modality AI models.

2.9K

Active

Python

Inference

Multimodal

PyTorch

#audio-generation#diffusion#image-generation

alexzhang13/rlm

A general plug-and-play inference library for Recursive Language Models (RLMs) supporting various sandboxes.

2.9K

Active

Python

LLM Frameworks

React

#inference#recursive language models#sandbox

ruvnet/ruvector

High-performance vector graph neural network database in Rust for real-time AI inference and graph ML.

2.9K

Active

Rust

Inference

Vector Databases

Rust

#vector-database#gnn#graph-neural-networks

NVIDIA/MinkowskiEngine

A high-performance, auto-diff neural network library for 3D and 4D sparse tensor computations.

2.9K

Archived

Python

Computer Vision

ML Ops

PyTorch

#3d-convolutional-network#4d-convolutional-neural-network#sparse-tensor-network

b4rtaz/distributed-llama

A C++ library for distributed large language model inference, allowing developers to build powerful AI applications with a cluster of home devices.

2.8K

Active

C++

LLM Frameworks

Containerization

#distributed-computing#llm-inference#llama2

spiceai/spiceai

A portable accelerated SQL query, search, and LLM-inference engine for data-grounded AI apps and agents.

2.8K

Active

Rust

LLM Frameworks

Databases

Rust

#artificial-intelligence#data-federation#full-text-search

sonos/tract

A lightweight, self-contained Rust library for running Tensorflow and ONNX models with no dependencies

2.8K

Active

Rust

Inference

API Frameworks

#tensorflow#onnx#inference

janhq/cortex.cpp

A C++ library for building local AI inference platforms with support for ONNX models.

2.8K

Experimental

C++

LLM Frameworks

API Clients & Testing

#onnx#onnxruntime#llm

stan-dev/stan

An open-source C++ library for Bayesian inference and data analysis using Markov Chain Monte Carlo (MCMC) methods.

2.7K

Active

C++

Bayesian

API Frameworks

#bayesian-inference#statistical-modeling#mcmc

FasterDecoding/Medusa

A simple framework for accelerating LLM generation with multiple decoding heads

2.7K

Archived

Jupyter Notebook

LLM Frameworks

Inference

Jupyter Notebook

#llm#generation#inference

airockchip/rknn-toolkit2

This repository provides an AI-powered toolkit for developing applications with the Rockchip RKNN inference engine.

2.7K

Experimental

Inference

API Frameworks

#rknn#rockchip#ai-inference

mahyarnajibi/SNIPER

An efficient multi-scale object detection training and inference algorithm for computer vision tasks.

2.7K

Archived

Python

Computer Vision

Python

#computer-vision#object-detection#deep-learning

facebookresearch/sam-3d-body

This repository provides code and models for running inference with the SAM 3D Body Model, a tool for 3D body reconstruction.

2.7K

Stable

Python

Computer Vision

Inference

Python

#3d-body#computer-vision#inference

autodistill/autodistill

An AI-powered tool for training supervised models without manual labeling, using foundation models and multimodal learning.

2.6K

Experimental

Python

Computer Vision

Model Distillation

PyTorch

#auto-labeling#computer-vision#foundation-models

containers/ramalama

RamaLama simplifies local serving of AI models and enables their use for inference in production via containers.

2.6K

Active

Python

LLM Frameworks

Inference

Python

#ai#containers#inference-server

pyro-ppl/numpyro

A probabilistic programming library powered by NumPy and JAX for Bayesian inference and MCMC sampling.

2.6K

Active

Python

Inference

Caching

NumPy

#bayesian-inference#hmc#jax

llm-d/llm-d

Achieve state-of-the-art inference performance on modern accelerators with this Kubernetes-based solution.

2.6K

Active

Shell

Inference

Containerization

#kubernetes#inference#performance

xdit-project/xDiT

A scalable inference engine for diffusion transformers with massive parallelism

2.6K

Active

Python

React

#authentication#streaming#inference

1...79...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.