Showing 41-60 of 228 projects
A powerful Python framework for building real-time voice AI agents powered by OpenAI and other AI tools.
A robust, efficient, and low-latency speech-to-text library with advanced voice activity detection and wake word activation.
A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.
A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.
An open-source voice chat application built with TypeScript, Elixir, and React.
Qwen3-TTS is an open-source series of TTS models that enable stable, expressive, and streaming speech generation.
A sound cloning tool that lets you use your voice or any sound to record audio, with a web interface.
EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.
Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.
An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.
Mumble is an open-source, low-latency, high-quality voice chat software for gaming and communication.
Simple WebRTC library for creating peer-to-peer data, video, and voice channels in the browser and Node.js.
An open-source alternative to Twilio for cloud communications and programmable voice/telephony.
A multilingual voice understanding model for AI-powered audio analysis and transcription.
Automagically synchronize subtitles with video using audio alignment and speech detection.
Open-source platform for building low-latency vision AI agents using any model or video provider
An open-source Chinese voice assistant project that supports ChatGPT-like conversational abilities and brain-computer interface integration.
AI suite with advanced AI/AGI functions, including personas, multi-model chats, text-to-image, voice, and more.
Speech recognition library for your web application, enabling voice interactions.
Mycroft Core is an open-source AI platform for building voice assistants and smart home integrations.
Get weekly updates on trending AI coding tools and projects.