Showing 1-6 of 6 projects
Optimized attention mechanism for deep learning
Qwen is a large language model series by Alibaba Cloud with multiple variants and capabilities.
LeetCUDA is a comprehensive collection of modern CUDA learning resources, including 200+ CUDA kernels, Tensor Cores, HGEMM, and FA-2 MMA.
An open-source Chinese version of the LLaMA and Alpaca language models with 64K long context support for advanced NLP applications.
Official release of the InternLM series of large language models focused on building AI tools and chatbots.
A curated list of awesome papers and code for optimizing LLM/VLM inference performance
Get weekly updates on trending AI coding tools and projects.