Explore Projects

Discover 173 open source projects

Active filters (1):
Search: servingร—
Clear all

Showing 61-80 of 173 projects

beclab/Olares

Olares is an open-source personal cloud platform to reclaim your data and enable local AI computing.

4.2K
Active
Go
MCP Servers
Agents & Orchestration
Go
#ai-privacy#edge-ai#home-automation

FedML-AI/FedML

A unified and scalable ML library for large-scale distributed training, model serving, and federated learning.

4.0K
Stable
Python
ML Ops
Inference
React
#ai#machine-learning#federated-learning

ModelTC/LightLLM

LightLLM is a high-performance, scalable Python-based framework for inference and serving of large language models.

3.9K
Active
Python
LLM Frameworks
API Frameworks
#llm#model-serving#deep-learning

skyzh/tiny-llm

A course on building a tiny vLLM (virtual Large Language Model) and Qwen inference serving on Apple Silicon for systems engineers.

3.9K
Stable
Python
LLM Frameworks
LLM Wrappers & SDKs
Python
#llm#qwen#vllm

rakyll/statik

Embed files into a Go executable, making it easy to serve static content with Go.

3.8K
Active
Go
API Frameworks
Backend Frameworks
#go#golang#http

Lightning-AI/LitServe

A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.

3.8K
Active
Python
AI Model Serving
API Frameworks
FastAPI
#ai#inference#serving

undertow-io/undertow

Undertow is a high-performance, non-blocking web server written in Java, suitable for serving large-scale web apps.

3.7K
Active
Java
API Frameworks
Backend Frameworks
Jakarta EE
#http#servlet#websocket

HuaizhengZhang/AI-Infra-from-Zero-to-Hero

A comprehensive collection of resources and tutorials for building AI infrastructure and systems.

3.7K
Experimental
LLM Frameworks
ML Ops
#ai-infrastructure#machine-learning#llm

predibase/lorax

Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs

3.7K
Experimental
Python
LLM Frameworks
BaaS Platforms
PyTorch
#llm#fine-tuning#model-serving

PaddlePaddle/FastDeploy

High-performance Inference and Deployment Toolkit for LLMs and VLMs

3.7K
Active
Python
PaddlePaddle
#inference#deployment#LLMs

sgl-project/mini-sglang

A compact Python implementation of SGLang to demystify modern LLM serving systems.

3.6K
Active
Python
LLM Frameworks
LLM Wrappers & SDKs
#llm#language-model#serving

mellow-io/mellow

Mellow is a rule-based global transparent proxy client for Windows, macOS and Linux, serving as a Proxifier alternative.

3.6K
Archived
JavaScript
Authentication
CLI Tools
#proxy#vpn#networking

nuwave/lighthouse

Laravel GraphQL framework for building scalable and efficient APIs

3.5K
Active
PHP
GraphQL
Authentication
React
#authentication#graphql#laravel

thu-pacman/chitu

High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.

3.4K
Active
Python
LLM Frameworks
API Frameworks
PyTorch
#llm#inference#gpu

TheWaWaR/simple-http-server

A simple, cross-platform HTTP server written in Rust for serving static files.

3.4K
Stable
Rust
Backend Frameworks
CLI Tools
#http#server#static-file-serving

IBM/mcp-context-forge

A gateway and registry for the Model Context Protocol (MCP) that enables interoperability between AI tools and LLM applications.

3.4K
Active
Python
MCP Servers
LLM Frameworks
FastAPI
#api-gateway#model-context-protocol#llm-agents

unpkg/unpkg

The CDN for everything on npm, providing a simple way to serve npm packages in the browser

3.4K
Experimental
TypeScript
Backend Frameworks
CLI Tools
React
#cdn#npm#browser

broccolijs/broccoli

A browser compilation library that serves as an asset pipeline for applications running in the browser.

3.3K
Stable
JavaScript
Component Libraries (React)
Frontend Frameworks
React
#asset-pipeline#browser-compilation#build-tool

ob-f/OpenBot

An open-source robotics platform that leverages smartphones as the brain for low-cost robot bodies.

3.2K
Experimental
Swift
Robotics
Example Projects
Android
#robotics#smartphone#android

alpa-projects/alpa

Alpa is a distributed training and serving framework for large-scale neural networks with auto-parallelization.

3.2K
Archived
Python
LLM Frameworks
API Frameworks
JAX
#distributed-computing#high-performance-computing#auto-parallelization
1...35...9

Stay in the loop

Get weekly updates on trending AI coding tools and projects.