Showing 1-2 of 2 projects
DeepSpeed optimizes deep learning training and inference with distributed computing techniques.
Run billion-parameter LLMs on embedded devices with extreme quantization for edge inference
Get weekly updates on trending AI coding tools and projects.