Showing 1-6 of 6 projects
High-performance serving framework for large language and multimodal models
Mooncake is a serving platform for Kimi, a leading LLM service provided by Moonshot AI, focused on disaggregation, inference, and RDMA.
Optimize AI inference performance on GPUs with this Python library for selecting and tuning inference engines.
A compact Python implementation of SGLang to demystify modern LLM serving systems.
LLMs-based Operators and Pipelines for data prep
An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.
Get weekly updates on trending AI coding tools and projects.