Showing 41-60 of 124 projects
A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.
A nearly-live implementation of OpenAI's Whisper, a powerful speech recognition and translation tool.
A Python library that provides real-time text-to-speech conversion capabilities for developers.
A lightweight, fast, and efficient text-to-speech library for developers who need to add voice functionality to their projects.
A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.
Fast local neural text-to-speech engine for offline voice synthesis
An open-source text-to-speech software that enables high-quality, free-to-use voice generation.
A high-quality voice conversion tool focused on ease of use and performance for AI-powered audio applications.
A versatile WebUI for various AI-powered text-to-speech engines, enabling vibe coders to explore and utilize cutting-edge audio generation tools.
An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.
Official Python SDK for ElevenLabs text-to-speech API with voice synthesis & audio generation.
A Python/C library and toolkit for automatically synchronizing audio and text (forced alignment).
A fast, on-device, multilingual text-to-speech (TTS) library running natively via ONNX.
A Python library and CLI tool to interface with Google Translate's text-to-speech API.
A Python-based tool that enables easy deployment of ChatTTS, supporting features like streaming output, voice selection, and multi-character reading.
An open-source, multilingual text-to-speech synthesis system written in pure Java.
Offline Text-to-Speech library for Python developers to add speech synthesis to their applications.
Executable file for VITS inference, a neural text-to-speech model for generating high-quality speech.
An open-source Python library for building text-to-speech (TTS) applications using the Kokoro engine and ONNX runtime.
Tacotron-2 is a state-of-the-art text-to-speech model that vibe coders can use to build speech synthesis applications.
Get weekly updates on trending AI coding tools and projects.