Showing 21-22 of 22 projects
Pure C inference engine for Mistral Voxtral 4B speech-to-text model with minimal dependencies
Run billion-parameter LLMs on embedded devices with extreme quantization for edge inference
Get weekly updates on trending AI coding tools and projects.