Showing 21-40 of 130 projects
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
Open-source voice synthesis studio powered by Qwen3-TTS
A lightweight, state-of-the-art text-to-speech (TTS) model for developers building AI-powered applications.
Spark-TTS is an open-source Python library for high-quality text-to-speech inference.
A platform that simplifies the use of cutting-edge AI technologies for developers through GUI tools.
A fast, local neural text-to-speech system for developers building voice-enabled applications.
A Python library that allows developers to use Microsoft Edge's online text-to-speech service without requiring Edge or an API key.
A deep learning library for text-to-speech applications, focusing on generating high-quality speech from text.
KrillinAI is a video translation and dubbing tool powered by LLMs, offering 100 language translations and one-click deployment.
A multi-purpose platform for developers to discover and use various reading and media resources, including AI-powered tools.
Qwen3-TTS is an open-source series of TTS models that enable stable, expressive, and streaming speech generation.
A sound cloning tool that lets you use your voice or any sound to record audio, with a web interface.
Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.
EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.
An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.
A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.
A simple native web interface for ChatTTS text-to-speech synthesis with API support.
Open-source platform for building low-latency vision AI agents using any model or video provider
High-quality multi-lingual text-to-speech library supporting English, Spanish, French, Chinese, Japanese and Korean.
Zonos is an open-source, high-quality text-to-speech model for developers building AI-powered applications.
Get weekly updates on trending AI coding tools and projects.