Inference

Explore 351 open source projects in Inference

Showing 221-240 of 351 projects

microsoft/onnxruntime-inference-examples

Examples for using ONNX Runtime for machine learning inferencing.

1.6K
Active
C++
Inference
#machine-learning#onnx#inference

mit-han-lab/smoothquant

SmoothQuant is an efficient post-training quantization tool for large language models, enabling accurate and fast inference.

1.6K
Archived
Python
LLM Frameworks
Inference
Python
#quantization#large-language-models#performance-optimization

PaddlePaddle/PaddleSlim

PaddleSlim is an open-source library for deep model compression and architecture search.

1.6K
Active
Python
Inference
ML Ops
PyTorch
#compression#architecture-search#model-optimization

divamgupta/stable-diffusion-tensorflow

A TensorFlow/Keras implementation of the Stable Diffusion AI model for text-to-image generation.

1.6K
Archived
Python
Computer Vision
Inference
TensorFlow
#stable-diffusion#text-to-image#computer-vision

dotnet/infer

Infer.NET is a C# framework for running Bayesian inference in graphical models, useful for machine learning.

1.6K
Stable
C#
Inference
API Frameworks
#bayesian-inference#machine-learning#graphical-models

google-research/parti

An AI-powered text-to-image generation model for creating high-quality images from text prompts.

1.6K
Archived
LLM Frameworks
Computer Vision
Tensorflow
#text-to-image#image-generation#machine-learning

TsinghuaAI/CPM-1-Generate

A pre-trained Chinese language model for text generation, useful for AI-powered coding and content creation.

1.6K
Archived
Python
LLM Frameworks
Inference
Python
#text-generation#chinese-language-model#pre-trained-model

dfm/emcee

An ensemble sampling toolkit for affine-invariant Markov Chain Monte Carlo (MCMC) in Python.

1.6K
Active
Python
Inference
Caching
#mcmc#probabilistic-modeling#data-analysis

google/uncertainty-baselines

High-quality implementations of standard and state-of-the-art methods for Bayesian and probabilistic machine learning.

1.6K
Active
Python
ML Ops
Fine-tuning
TensorFlow
#bayesian-methods#probabilistic-programming#data-science

HIPS/Spearmint

Spearmint is a Bayesian optimization codebase for AI and machine learning experiments.

1.6K
Archived
Python
Agents & Orchestration
Inference
Python
#bayesian-optimization#hyperparameter-tuning#machine-learning

RWKV/rwkv.cpp

An efficient C++ implementation of the RWKV language model for fast CPU inference on various bit-width quantizations.

1.6K
Experimental
C++
LLM Frameworks
Inference
#language-model#llm#quantization

thu-ml/prolificdreamer

A high-performance text-to-3D generation model for building immersive 3D experiences with AI tools.

1.6K
Archived
Python
Text-to-Image & Text-to-3D
Fine-tuning
PyTorch
#text-to-3d#diffusion-model#stablediffusion

SonyResearch/micro_diffusion

Official repository for work on micro-budget training of large-scale diffusion models for AI coding tools.

1.5K
Archived
Python
Inference
AI Code Generation
Python
#diffusion-models#machine-learning#code-generation

Tencent/TurboTransformers

A fast and user-friendly runtime for running transformer models like BERT, GPT-2, and others on CPU and GPU.

1.5K
Experimental
C++
LLM Frameworks
Inference
PyTorch
#bert#gpt2#transformer

XueZeyue/DanceGRPO

An official implementation of DanceGRPO, a visual generation model that leverages GRPO techniques.

1.5K
Stable
Python
Computer Vision
Inference
Python
#computer-vision#generative-ai#visual-generation

google-research/big_transfer

Official repository for the 'Big Transfer (BiT): General Visual Representation Learning' paper, focused on transfer learning for computer vision.

1.5K
Archived
Python
Computer Vision
Inference
PyTorch
#computer-vision#transfer-learning#representation-learning

AnswerDotAI/fsdp_qlora

A repository for training large language models (LLMs) using QLoRA and FSDP techniques.

1.5K
Archived
Jupyter Notebook
LLM Frameworks
Fine-tuning
PyTorch
#llm#fine-tuning#inference

mlcommons/inference

A reference implementation of MLPerf inference benchmarks for machine learning performance testing.

1.5K
Active
Python
Inference
Testing
Python
#benchmark#machine-learning#performance-testing

google-ai-edge/LiteRT

LiteRT is Google's on-device framework for high-performance ML & GenAI deployment on edge platforms.

1.5K
Active
C++
Inference
ML Ops
C++
#machine-learning#edge-computing#inference

o19s/elasticsearch-learning-to-rank

Plugin to integrate Learning to Rank (machine learning for better relevance) with Elasticsearch

1.5K
Stable
Java
Inference
Search
#elasticsearch#machine-learning#search-relevance
1...1113...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.