Showing 1-20 of 124 projects
Few-shot voice cloning and TTS with 1 min training data
Fine-tuning & RL for LLMs with optimized performance and memory use
๐ธTTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.
Generates natural speech for dialogue scenarios
Voice cloning tool for real-time speech generation
Instant voice cloning model with tone color cloning and multi-lingual support
Multilingual voice generation model with full-stack capabilities for TTS, training, and deployment
An ultra-realistic text-to-speech model for generating natural-sounding dialogue and audio.
An efficient zero-shot text-to-speech system with fine-grained control over the generated voice.
An open-source personal assistant that provides AI-powered voice and text interactions.
A scalable generative AI framework for researchers and developers
A Python library that translates videos from one language to another, with support for dubbing and subtitles.
A multi-voice text-to-speech (TTS) system with a focus on high-quality audio output.
Official code for a text-to-speech model that generates fluent and faithful speech with flow matching.
A lightweight, state-of-the-art text-to-speech (TTS) model for developers building AI-powered applications.
Spark-TTS is an open-source Python library for high-quality text-to-speech inference.
A fast, local neural text-to-speech system for developers building voice-enabled applications.
An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.
A Python library that allows developers to use Microsoft Edge's online text-to-speech service without requiring Edge or an API key.
A deep learning library for text-to-speech applications, focusing on generating high-quality speech from text.
Get weekly updates on trending AI coding tools and projects.