Inference

Explore 351 open source projects in Inference

Showing 181-200 of 351 projects

FoundationVision/LlamaGen

A scalable image generation model based on the Llama language model, outperforming diffusion models.

1.9K
Archived
Python
LLM Frameworks
Computer Vision
Python
#auto-regressive-model#diffusion-models#image-generation

withcatai/node-llama-cpp

Run AI models like LLaMA locally on your machine with Node.js bindings for llama.cpp and enforce JSON schema on the output.

1.9K
Active
TypeScript
LLM Wrappers & SDKs
Inference
Node.js
#ai#llama#llama-cpp

vortexgpgpu/vortex

This is a Verilog-based GPGPU hardware project for accelerating AI/ML workloads.

1.9K
Active
Verilog
Inference
Embedded
#gpu#verilog#ai-acceleration

meta-pytorch/opacus

A library for training PyTorch models with differential privacy, enabling privacy-preserving machine learning.

1.9K
Stable
Jupyter Notebook
Machine Learning Ops
Inference
PyTorch
#differential-privacy#machine-learning#neural-network

uTensor/uTensor

uTensor is a TinyML AI inference library for microcontrollers and edge devices, enabling embedded AI applications.

1.9K
Experimental
C++
Inference
API Frameworks
#tinyml#embedded-ai#microcontrollers

Yuanshi9815/OminiControl

OminiControl is a minimal and universal control system for diffusion transformer models like DALL-E and Stable Diffusion.

1.9K
Experimental
Python
LLM Frameworks
Inference
Python
#diffusion-models#computer-vision#image-generation

PixArt-alpha/PixArt-sigma

A diffusion transformer model for generating high-quality 4K text-to-image art, focused on vibe coders and AI developers.

1.9K
Archived
Python
LLM Frameworks
Computer Vision
Python
#text-to-image#diffusion-model#transformer

d8ahazard/sd_dreambooth_extension

A Python extension for the Stable Diffusion AI model, focused on the DreamBooth fine-tuning technique.

1.9K
Stable
Python
Fine-tuning
Inference
Python
#stable-diffusion#dreambooth#fine-tuning

tobegit3hub/tensorflow_template_application

A TensorFlow template application for building deep learning models and deploying them to production.

1.9K
Archived
Python
Inference
ML Ops
TensorFlow
#deep-learning#machine-learning#inference

antirez/iris.c

Flux 2 is a pure C inference engine for an image generation model, useful for vibe coders working with AI tools.

1.9K
Active
C
Inference
#image-generation#ai-model#inference

black-forest-labs/flux2

Official inference repo for FLUX.2 models, a library for building AI-powered applications.

1.9K
Active
Python
LLM Frameworks
Inference
Python
#ai-models#inference#llm

filipstrand/mflux

A Python library for state-of-the-art generative image models, focused on AI and machine learning tools for vibe coders.

1.9K
Active
Python
Inference
Computer Vision
Python
#diffusers#huggingface#mlx

ray-project/llm-applications

A comprehensive guide for building production-ready RAG-based LLM applications using the Ray framework.

1.9K
Archived
Jupyter Notebook
LLM Frameworks
RAG & Vector
Jupyter Notebook
#llms#ray#fine-tuning

THUDM/LongWriter

LongWriter is a fine-tuned large language model (LLM) that can generate high-quality long-form text of 10,000+ words from long-form context.

1.8K
Experimental
Python
LLM Frameworks
Fine-tuning
Python
#llm#long-context#long-text

microsoft/onnxjs

ONNX.js allows developers to run ONNX machine learning models using JavaScript in the browser or Node.js.

1.8K
Archived
TypeScript
Inference
Backend Frameworks
TypeScript
#machine-learning#inference#onnx

EmpireMediaScience/A1111-Web-UI-Installer

A complete installer for Automatic1111's popular Stable Diffusion WebUI, a tool for AI-powered image generation.

1.8K
Archived
PowerShell
Inference
Full-Stack Frameworks
#stable-diffusion#ai-image-generation#web-ui

Fictiverse/Redream

A C# library that provides a realtime interface to the Automatic1111 Stable Diffusion API for AI-powered image generation.

1.8K
Archived
C#
Agents & Orchestration
Computer Vision
#ai-image-generation#stable-diffusion#realtime

OpenPPL/ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

1.8K
Archived
Python
Inference
API Frameworks
PyTorch
#neural-network#quantization#deep-learning

xororz/local-dream

A Kotlin library that runs Stable Diffusion on Android devices with Snapdragon NPU acceleration.

1.8K
Stable
Kotlin
Inference
Android
Android
#stable-diffusion#android-ai#image-generation

webonnx/wonnx

A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web.

1.7K
Archived
Rust
Inference
Frontend Frameworks
Rust
#onnx#webassembly#webgpu
1...911...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.