Showing 1-5 of 5 projects
End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.
DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.
An open-source successor to UTAU, a platform for singing voice synthesis and audio production.
A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.
Official PyTorch implementation of BigVGAN, a neural vocoder for generating high-quality audio, music, and speech.
Get weekly updates on trending AI coding tools and projects.