Showing 61-80 of 173 projects
Olares is an open-source personal cloud platform to reclaim your data and enable local AI computing.
A unified and scalable ML library for large-scale distributed training, model serving, and federated learning.
LightLLM is a high-performance, scalable Python-based framework for inference and serving of large language models.
A course on building a tiny vLLM (virtual Large Language Model) and Qwen inference serving on Apple Silicon for systems engineers.
Embed files into a Go executable, making it easy to serve static content with Go.
A minimal Python framework for building custom AI inference servers with full control over logic, batching, and scaling.
Undertow is a high-performance, non-blocking web server written in Java, suitable for serving large-scale web apps.
A comprehensive collection of resources and tutorials for building AI infrastructure and systems.
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
High-performance Inference and Deployment Toolkit for LLMs and VLMs
A compact Python implementation of SGLang to demystify modern LLM serving systems.
Mellow is a rule-based global transparent proxy client for Windows, macOS and Linux, serving as a Proxifier alternative.
Laravel GraphQL framework for building scalable and efficient APIs
High-performance inference framework for large language models, focusing on efficiency, flexibility, and availability.
A simple, cross-platform HTTP server written in Rust for serving static files.
A gateway and registry for the Model Context Protocol (MCP) that enables interoperability between AI tools and LLM applications.
The CDN for everything on npm, providing a simple way to serve npm packages in the browser
A browser compilation library that serves as an asset pipeline for applications running in the browser.
An open-source robotics platform that leverages smartphones as the brain for low-cost robot bodies.
Alpa is a distributed training and serving framework for large-scale neural networks with auto-parallelization.
Get weekly updates on trending AI coding tools and projects.