Inference

Explore 351 open source projects in Inference

Showing 181-200 of 351 projects

FoundationVision/LlamaGen

A scalable image generation model based on the Llama language model, outperforming diffusion models.

1.9K

Archived

Python

LLM Frameworks

Computer Vision

Python

#auto-regressive-model#diffusion-models#image-generation

withcatai/node-llama-cpp

Run AI models like LLaMA locally on your machine with Node.js bindings for llama.cpp and enforce JSON schema on the output.

1.9K

Active

TypeScript

LLM Wrappers & SDKs

Inference

Node.js

#ai#llama#llama-cpp

vortexgpgpu/vortex

This is a Verilog-based GPGPU hardware project for accelerating AI/ML workloads.

1.9K

Active

Verilog

Inference

Embedded

#gpu#verilog#ai-acceleration

meta-pytorch/opacus

A library for training PyTorch models with differential privacy, enabling privacy-preserving machine learning.

1.9K

Stable

Jupyter Notebook

Machine Learning Ops

Inference

PyTorch

#differential-privacy#machine-learning#neural-network

uTensor/uTensor

uTensor is a TinyML AI inference library for microcontrollers and edge devices, enabling embedded AI applications.

1.9K

Experimental

C++

Inference

API Frameworks

#tinyml#embedded-ai#microcontrollers

Yuanshi9815/OminiControl

OminiControl is a minimal and universal control system for diffusion transformer models like DALL-E and Stable Diffusion.

1.9K

Experimental

Python

LLM Frameworks

Inference

Python

#diffusion-models#computer-vision#image-generation

PixArt-alpha/PixArt-sigma

A diffusion transformer model for generating high-quality 4K text-to-image art, focused on vibe coders and AI developers.

1.9K

Archived

Python

LLM Frameworks

Computer Vision

Python

#text-to-image#diffusion-model#transformer

d8ahazard/sd_dreambooth_extension

A Python extension for the Stable Diffusion AI model, focused on the DreamBooth fine-tuning technique.

1.9K

Stable

Python

Fine-tuning

Inference

Python

#stable-diffusion#dreambooth#fine-tuning

tobegit3hub/tensorflow_template_application

A TensorFlow template application for building deep learning models and deploying them to production.

1.9K

Archived

Python

Inference

ML Ops

TensorFlow

#deep-learning#machine-learning#inference

antirez/iris.c

Flux 2 is a pure C inference engine for an image generation model, useful for vibe coders working with AI tools.

1.9K

Active

Inference

#image-generation#ai-model#inference

black-forest-labs/flux2

Official inference repo for FLUX.2 models, a library for building AI-powered applications.

1.9K

Active

Python

LLM Frameworks

Inference

Python

#ai-models#inference#llm

filipstrand/mflux

A Python library for state-of-the-art generative image models, focused on AI and machine learning tools for vibe coders.

1.9K

Active

Python

Inference

Computer Vision

Python

#diffusers#huggingface#mlx

ray-project/llm-applications

A comprehensive guide for building production-ready RAG-based LLM applications using the Ray framework.

1.9K

Archived

Jupyter Notebook

LLM Frameworks

RAG & Vector

Jupyter Notebook

#llms#ray#fine-tuning

THUDM/LongWriter

LongWriter is a fine-tuned large language model (LLM) that can generate high-quality long-form text of 10,000+ words from long-form context.

1.8K

Experimental

Python

LLM Frameworks

Fine-tuning

Python

#llm#long-context#long-text

microsoft/onnxjs

ONNX.js allows developers to run ONNX machine learning models using JavaScript in the browser or Node.js.

1.8K

Archived

TypeScript

Inference

Backend Frameworks

TypeScript

#machine-learning#inference#onnx

EmpireMediaScience/A1111-Web-UI-Installer

A complete installer for Automatic1111's popular Stable Diffusion WebUI, a tool for AI-powered image generation.

1.8K

Archived

PowerShell

Inference

Full-Stack Frameworks

#stable-diffusion#ai-image-generation#web-ui

Fictiverse/Redream

A C# library that provides a realtime interface to the Automatic1111 Stable Diffusion API for AI-powered image generation.

1.8K

Archived

Agents & Orchestration

Computer Vision

#ai-image-generation#stable-diffusion#realtime

OpenPPL/ppq

PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.

1.8K

Archived

Python

Inference

API Frameworks

PyTorch

#neural-network#quantization#deep-learning

xororz/local-dream

A Kotlin library that runs Stable Diffusion on Android devices with Snapdragon NPU acceleration.

1.8K

Stable

Kotlin

Inference

Android

#stable-diffusion#android-ai#image-generation

webonnx/wonnx

A WebGPU-accelerated ONNX inference run-time written 100% in Rust, ready for native and the web.

1.7K

Archived

Rust

Inference

Frontend Frameworks

Rust

#onnx#webassembly#webgpu

1...911...18

Stay in the loop

Get weekly updates on trending AI coding tools and projects.