Showing 1-4 of 4 projects
DeepSpeed optimizes deep learning training and inference with distributed computing techniques.
Colossal-AI optimizes large AI model training and inference with distributed computing and GPU acceleration.
A fast, portable data-parallel computation language for image processing and performance optimization.
A high-performance C++ SIMD vector library for data-parallel computing across various CPU architectures.
Get weekly updates on trending AI coding tools and projects.