Showing 1-15 of 15 projects
๐ธTTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
A deep learning library for text-to-speech applications, focusing on generating high-quality speech from text.
Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.
Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.
A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.
Modular, open-source platform for building voice-based AI agents and LLM integrations.
A high-quality open-source PyTorch implementation of the WaveNet vocoder, a neural network for speech synthesis.
A high-quality neural vocoder and text-to-speech (TTS) library built with PyTorch.
Unofficial Parallel WaveGAN repository with PyTorch implementation of various neural vocoders for speech synthesis.
A high-quality speech analysis, manipulation and synthesis system written in C++.
A Python library for speech restoration, including tasks like declipping, denoising, and dereverberation.
Official PyTorch implementation of BigVGAN, a neural vocoder for generating high-quality audio, music, and speech.
A PyTorch-based library for zero-shot voice style transfer using only autoencoder loss.
Vocos is a high-quality audio synthesis library that bridges the gap between time-domain and Fourier-based neural vocoders.
Get weekly updates on trending AI coding tools and projects.