Explore Projects

Discover 321 open source projects

Active filters (1):
Search: inferenceร—
Clear all

Showing 141-160 of 321 projects

pytorch/TensorRT

PyTorch compiler for NVIDIA GPUs using TensorRT, enabling efficient deep learning inference on CUDA hardware.

3.0K
Active
Python
ML Ops
API Frameworks
PyTorch
#deep-learning#cuda#nvidia

evilsocket/cake

A distributed inference engine for large language models and StableDiffusion on mobile, desktop and server.

3.0K
Archived
Rust
LLM Frameworks
Inference
#llm#stable-diffusion#inference

vllm-project/vllm-omni

A Python framework for efficient model inference with omni-modality AI models.

2.9K
Active
Python
Inference
Multimodal
PyTorch
#audio-generation#diffusion#image-generation

alexzhang13/rlm

A general plug-and-play inference library for Recursive Language Models (RLMs) supporting various sandboxes.

2.9K
Active
Python
LLM Frameworks
React
#inference#recursive language models#sandbox

ruvnet/ruvector

High-performance vector graph neural network database in Rust for real-time AI inference and graph ML.

2.9K
Active
Rust
Inference
Vector Databases
Rust
#vector-database#gnn#graph-neural-networks

NVIDIA/MinkowskiEngine

A high-performance, auto-diff neural network library for 3D and 4D sparse tensor computations.

2.9K
Archived
Python
Computer Vision
ML Ops
PyTorch
#3d-convolutional-network#4d-convolutional-neural-network#sparse-tensor-network

b4rtaz/distributed-llama

A C++ library for distributed large language model inference, allowing developers to build powerful AI applications with a cluster of home devices.

2.8K
Active
C++
LLM Frameworks
Containerization
#distributed-computing#llm-inference#llama2

spiceai/spiceai

A portable accelerated SQL query, search, and LLM-inference engine for data-grounded AI apps and agents.

2.8K
Active
Rust
LLM Frameworks
Databases
Rust
#artificial-intelligence#data-federation#full-text-search

sonos/tract

A lightweight, self-contained Rust library for running Tensorflow and ONNX models with no dependencies

2.8K
Active
Rust
Inference
API Frameworks
#tensorflow#onnx#inference

janhq/cortex.cpp

A C++ library for building local AI inference platforms with support for ONNX models.

2.8K
Experimental
C++
LLM Frameworks
API Clients & Testing
#onnx#onnxruntime#llm

stan-dev/stan

An open-source C++ library for Bayesian inference and data analysis using Markov Chain Monte Carlo (MCMC) methods.

2.7K
Active
C++
Bayesian
API Frameworks
#bayesian-inference#statistical-modeling#mcmc

FasterDecoding/Medusa

A simple framework for accelerating LLM generation with multiple decoding heads

2.7K
Archived
Jupyter Notebook
LLM Frameworks
Inference
Jupyter Notebook
#llm#generation#inference

airockchip/rknn-toolkit2

This repository provides an AI-powered toolkit for developing applications with the Rockchip RKNN inference engine.

2.7K
Experimental
C
Inference
API Frameworks
#rknn#rockchip#ai-inference

mahyarnajibi/SNIPER

An efficient multi-scale object detection training and inference algorithm for computer vision tasks.

2.7K
Archived
Python
Computer Vision
Python
#computer-vision#object-detection#deep-learning

facebookresearch/sam-3d-body

This repository provides code and models for running inference with the SAM 3D Body Model, a tool for 3D body reconstruction.

2.7K
Stable
Python
Computer Vision
Inference
Python
#3d-body#computer-vision#inference

autodistill/autodistill

An AI-powered tool for training supervised models without manual labeling, using foundation models and multimodal learning.

2.6K
Experimental
Python
Computer Vision
Model Distillation
PyTorch
#auto-labeling#computer-vision#foundation-models

containers/ramalama

RamaLama simplifies local serving of AI models and enables their use for inference in production via containers.

2.6K
Active
Python
LLM Frameworks
Inference
Python
#ai#containers#inference-server

pyro-ppl/numpyro

A probabilistic programming library powered by NumPy and JAX for Bayesian inference and MCMC sampling.

2.6K
Active
Python
Inference
Caching
NumPy
#bayesian-inference#hmc#jax

llm-d/llm-d

Achieve state-of-the-art inference performance on modern accelerators with this Kubernetes-based solution.

2.6K
Active
Shell
Inference
Containerization
#kubernetes#inference#performance

xdit-project/xDiT

A scalable inference engine for diffusion transformers with massive parallelism

2.6K
Active
Python
React
#authentication#streaming#inference
1...79...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.