Explore Projects

Discover 321 open source projects

Active filters (1):
Search: inferenceร—
Clear all

Showing 121-140 of 321 projects

0hq/WebGPT

Run GPT model on the browser with WebGPU, a lightweight JavaScript implementation.

3.8K
Archived
JavaScript
AI Code Generation
Next.js
#GPT#WebGPU#JavaScript

yuanzhoulvpi2017/zero_nlp

A Chinese NLP solution with large models, data, training, and inference capabilities for developers.

3.8K
Stable
Jupyter Notebook
LLM Frameworks
API Frameworks
PyTorch
#bert#chatglm-6b#gpt

Michael-A-Kuykendall/shimmy

A free, open-source Rust inference server compatible with OpenAI-API, suitable for vibe coders

3.7K
Active
Rust
React
#authentication#inference-server#open-source

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

3.7K
Experimental
Python
LLM Frameworks
BaaS Platforms
PyTorch
#llm#fine-tuning#model-serving

PaddlePaddle/FastDeploy

High-performance Inference and Deployment Toolkit for LLMs and VLMs

3.7K
Active
Python
PaddlePaddle
#inference#deployment#LLMs

mit-han-lab/llm-awq

A library for efficient weight quantization of large language models to accelerate inference on edge devices.

3.5K
Experimental
Python
LLM Frameworks
LLM Wrappers & SDKs
Python
#llm#compression#acceleration

thu-pacman/chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

3.4K
Active
Python
LLM Frameworks
API Frameworks
PyTorch
#llm#inference#gpu

gluon-lang/gluon

Gluon is a static, type-inferred and embeddable programming language written in Rust for building compilers and language tooling.

3.4K
Archived
Rust
Compilers
API Frameworks
Rust
#compiler#embeddable#functional

facebookresearch/sam-audio

Provides code for running inference with the Meta Segment Anything Audio Model (SAM-Audio) and example notebooks.

3.4K
Active
Python
LLM Frameworks
React
#audio-modeling#inference#meta-segment

Lagrange-Labs/deep-prove

A Rust framework for blazingly fast inference of ML models using zero-knowledge proofs.

3.3K
Active
Rust
Inference
API Frameworks
#ai#ml#zk-snarks

matheusfacure/python-causality-handbook

A light-hearted yet rigorous approach to learning about impact estimation and causality in Python.

3.3K
Stable
Jupyter Notebook
Causal Inference
Data Science
Python
#causal-inference#econometrics#data-science

rguo12/awesome-causality-algorithms

An index of algorithms for learning causality with data, useful for vibe coders working on AI-powered applications.

3.2K
Archived
ML Ops
API Frameworks
#causality#causal-inference#recommender-system

Soul-AILab/SoulX-Podcast

An open-source codebase for generating high-fidelity podcasts from text using AI models.

3.2K
Stable
Python
Inference
Audio & Speech
Python
#audio-generation#text-to-speech#podcast-generation

thu-ml/SageAttention

Quantized attention that achieves 2-5x speedup over FlashAttention for language, image, and video models.

3.2K
Active
Cuda
Inference
Quantization
PyTorch
#attention#efficient-attention#inference-acceleration

NVIDIA/TransformerEngine

A high-performance Transformer library for accelerating AI models on NVIDIA GPUs, including low-precision support.

3.2K
Active
Python
LLM Frameworks
Inference
PyTorch
#deep-learning#gpu#cuda

guandeh17/Self-Forcing

Self Forcing: Bridging Training and Inference in Autoregressive Video Diffusion

3.2K
Stable
Python
LLM Frameworks
None
React
#autoregressive video diffusion#self forcing#neurips 2025 spotlight

pgmpy/pgmpy

Python library for Causal AI and Bayesian networks

3.2K
Active
Python
React
#causal-inference#bayesian-networks#probabilistic-inference

neuralmagic/deepsparse

A sparsity-aware deep learning inference runtime for CPUs, optimized for performance and efficiency.

3.2K
Experimental
Python
Inference
API Frameworks
PyTorch
#computer-vision#nlp#object-detection

mkocabas/VIBE

Official implementation of a CVPR2020 paper for video-based 3D human pose and shape estimation.

3.2K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#3d-human-pose#3d-pose-estimation#computer-vision

hao-ai-lab/FastVideo

A unified inference and post-training framework for accelerated video generation powered by AI.

3.1K
Active
Python
Computer Vision
Inference
PyTorch
#video-generation#diffusion-models#distillation
1...68...17

Stay in the loop

Get weekly updates on trending AI coding tools and projects.