Showing 1-5 of 5 projects
ML Systems textbook and hands-on learning stack for building real-world AI systems
Efficient implementations of state-of-the-art linear attention models for large language models and NLP tasks.
A flexible and fast reinforcement learning library for building LLM-powered agents and reasoning systems.
A comprehensive survey of efficient large language models for AI and machine learning systems.
Efficient implementations of Native Sparse Attention, a key component in large language models.
Get weekly updates on trending AI coding tools and projects.