Showing 1-20 of 51 projects
๐ธTTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.
An open-source personal assistant that provides AI-powered voice and text interactions.
A scalable generative AI framework for researchers and developers
A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
A fast, local neural text-to-speech system for developers building voice-enabled applications.
A Python library that allows developers to use Microsoft Edge's online text-to-speech service without requiring Edge or an API key.
End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.
Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.
A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.
EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.
A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.
A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.
An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.
Pre-trained text-to-speech models for various languages, made simple to use.
DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.
Open-source and modular AI-powered speech-to-speech translation tool built with Python.
A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.
A Python library that provides real-time text-to-speech conversion capabilities for developers.
An open-source successor to UTAU, a platform for singing voice synthesis and audio production.
Get weekly updates on trending AI coding tools and projects.