Explore Projects

Discover 321 open source projects

Active filters (1):

Search: inference×

Clear all

Showing 41-60 of 321 projects

NVIDIA/TensorRT

NVIDIA TensorRT is an SDK for high-performance deep learning inference on NVIDIA GPUs.

12.7K

Active

C++

Inference

#deep-learning#gpu-acceleration#inference

DayBreak-u/chineseocr_lite

A lightweight Chinese OCR library that supports vertical text recognition and NCNN/MNN/TNN inference with a small model size.

12.3K

Archived

C++

Computer Vision

PyTorch

#ocr#computer-vision#ncnn

bentoml/OpenLLM

Deploy open-source LLMs as OpenAI-compatible API endpoints using BentoML's model serving framework.

12.1K

Active

Python

AI Model Serving

Local Inference Engines

BentoML

#llm-inference#bentoml#model-serving

GeeeekExplorer/nano-vllm

A powerful Python library for working with large language models (LLMs) and natural language processing tasks.

12.0K

Stable

Python

LLM Frameworks

PyTorch

#llm#nlp#deep-learning

ubicloud/ubicloud

Open-source cloud platform with elastic compute, storage, databases, AI, and Kubernetes services.

11.9K

Active

Ruby

Managed Cloud Platforms

Ruby

#cloud#open-source#kubernetes

SparkAudio/Spark-TTS

Spark-TTS is an open-source Python library for high-quality text-to-speech inference.

10.9K

Experimental

Python

AI Voice & Speech

#text-to-speech#inference#open-source

aws/amazon-sagemaker-examples

A collection of Jupyter notebooks showcasing how to build and deploy machine learning models with Amazon SageMaker.

10.9K

Active

Jupyter Notebook

ML Ops

Jupyter Notebook

#machine-learning#deep-learning#data-science

huggingface/text-generation-inference

Large Language Model Text Generation Inference library for developers working with AI tools and models.

10.8K

Active

Python

LLM Frameworks

PyTorch

#nlp#transformer#bloom

mistralai/mistral-inference

Official inference library for Mistral models, a platform for building AI-powered applications.

10.7K

Stable

Jupyter Notebook

LLM Frameworks

React

#llm#llm-inference#mistralai

triton-inference-server/server

An optimized cloud and edge inference solution for deploying and running machine learning models.

10.4K

Active

Python

Inference

#machine-learning#deep-learning#gpu

Const-me/Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

10.2K

Archived

C++

Inference

#speech-recognition#whisper#asr

RunanywhereAI/runanywhere-sdks

Production ready AI toolkit for local AI inference

10.2K

Active

Kotlin

AI Coding Tools

#agent-framework#android#apple-intelligence

bigscience-workshop/petals

A distributed system for running large language models (LLMs) on personal devices, enabling faster fine-tuning and inference.

10.0K

Archived

Python

LLM Frameworks

PyTorch

#llm#distributed-computing#fine-tuning

openvinotoolkit/openvino

OpenVINO is an open-source toolkit for optimizing and deploying AI inference on a variety of hardware.

9.8K

Active

C++

Inference

#ai#computer-vision#deep-learning

deepseek-ai/3FS

A high-performance distributed file system designed for AI workloads like training and inference.

9.7K

Active

C++

Infrastructure

#distributed-file-system#ai-workloads#high-performance

pymc-devs/pymc

A powerful Bayesian modeling and probabilistic programming library for Python developers.

9.5K

Active

Python

LLM Frameworks

Python

#bayesian-inference#mcmc#probabilistic-programming

xorbitsai/inference

Unified, production-ready inference API to run open-source, speech, and multimodal models on cloud, on-prem, or your laptop.

9.1K

Active

Python

LLM Frameworks

Inference

PyTorch

#artificial-intelligence#llm#inference

pyro-ppl/pyro

A deep universal probabilistic programming library for Python and PyTorch, enabling Bayesian machine learning.

9.0K

Experimental

Python

LLM Frameworks

API Frameworks

PyTorch

#bayesian#probabilistic-modeling#variational-inference

oumi-ai/oumi

Easily fine-tune, evaluate and deploy open-source large language models like GPT-OSS and Llama.

8.9K

Active

Python

LLM Frameworks

Inference

Python

#llms#fine-tuning#evaluation

Tiiny-AI/PowerInfer

High-performance C++ library for fast local deployment of large language models (LLMs) like LLaMA.

8.8K

Active

C++

LLM Frameworks

API Frameworks

#llm#llm-inference#local-inference

1 24...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.