Showing 61-80 of 124 projects
A PyTorch implementation of VALL-E, a zero-shot text-to-speech model for vibe coders.
A fast and controllable text-to-speech library supporting over 7000 languages using deep learning and PyTorch.
A high-quality neural vocoder and text-to-speech (TTS) library built with PyTorch.
FastSpeech 2 implementation for high-quality end-to-end text-to-speech
A PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models.
An open-source text-to-speech tool supporting long-form text and multi-voice narration.
A lightweight, open-source translator that supports multiple translation engines and features like OCR and text-to-speech.
A multi-functional OCR tool for text recognition, translation, text-to-speech, manga translation, and more.
A developer-focused platform for text-to-speech, RAG, and LLMs, with local-first architecture.
A TensorFlow-based end-to-end text-to-speech synthesis model for vibe coders working on AI-powered applications.
The Alan AI SDK for Android provides a conversational AI platform for building voice assistants and chatbots.
The Alan AI SDK for Flutter enables building conversational AI-powered apps and voice interfaces.
A free and open-source speech synthesizer for Russian and other languages, supporting various platforms.
Real-time typing translation software with voice-to-text and text-to-speech capabilities for League of Legends players.
Automatically translate and dub videos using AI-powered text-to-speech and subtitle synchronization.
A self-coding system for Ionic apps using AI-powered chatbot and voice assistant SDK.
A Python-based agent that uses speech recognition and text-to-speech to enable conversational interactions via WhatsApp.
Unofficial Parallel WaveGAN repository with PyTorch implementation of various neural vocoders for speech synthesis.
Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.
A simple text-to-speech library for Node.js that allows developers to add voice output to their applications.
Get weekly updates on trending AI coding tools and projects.