Showing 1-5 of 5 projects
FishAudio-S1 is a high-quality open-source TTS model with voice cloning capabilities.
Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.
An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.
An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.
A PyTorch implementation of VALL-E, a zero-shot text-to-speech model for vibe coders.
Get weekly updates on trending AI coding tools and projects.