Showing 101-120 of 130 projects
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages and input formats.
Matcha-TTS is a fast and efficient text-to-speech (TTS) architecture using a conditional flow matching approach.
An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.
An all-in-one web UI for different audio-related neural networks, including text-to-speech, voice cloning, and generative music.
Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!
Real-time voice interactive digital human with customizable appearance and voice, supporting voice cloning.
A library of descriptions for C++17 features, presented in a format called 'Tony Tables'.
A Chinese text-to-speech engine supporting Cantonese, Tibetan and other languages.
Soprano is a Python library that provides ultra-realistic text-to-speech capabilities.
Free and open-source text-to-speech software built with Vue.js and Electron.
A Java-based framework for building AI-powered productivity tools, including chatbots, drawing, knowledge management, and more.
An open-source text-to-speech library built using Transformer-based neural networks for high-quality speech synthesis.
A TensorFlow-based text-to-speech model for vibe coders interested in AI-powered voice applications.
This GitHub repository is a open-source TTS (Text-to-Speech) tracking tool for developers.
A Java-based enterprise-level management platform for the Xiaozhi ESP32 device, providing device monitoring, audio customization, role switching, and conversation history management.
An open-source speech interaction system that integrates ASR, LLM, and TTS models for voice-based applications.
A Chrome extension that enables real-time translation of any language across various web content including PDF, ebooks, and more.
A C++ inference library for various SVC/TTS models, including DiffSinger, DiffSVC, HiFiGAN, and VITS.
Offline Russian voice assistant with plugin-based skills for developers working with AI tools.
A real-time text-to-speech model that can stream conversational audio in real-time.
Get weekly updates on trending AI coding tools and projects.