Showing 161-180 of 228 projects
Realtime AI voice agents with state-of-the-art multimodal AI models for AI toys, companions, and devices.
A Facebook Messenger Bot with voice recognition, NLP, and features like restaurant search and memo transcription.
A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.
A neural network model for detecting different emotions from audio speeches using Python and deep learning.
A Flutter-based Android/iOS voice chat app built on the Xiaozhi chatbot server.
A WebUI tool to create song covers using RVC v2 AI voices from audio files or YouTube videos.
A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.
A Python-based project that provides a TTS API server and Gradio-based web UI for speech synthesis and voice generation.
A Ruby library for interacting with the Twilio API and generating TwiML for voice and SMS applications.
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
A web component wrapper for the Web Speech API, enabling voice recognition and speech synthesis.
An open-source Chinese voice assistant project that runs on Raspberry Pi
A simple local voice assistant powered by Whisper and large language models.
An open-source Alexa client for building voice-enabled applications.
macOS offline speech-to-text app using local ML—no cloud, fully private voice dictation
This open-source Python library is a toolkit for building speech synthesis and voice conversion systems using deep learning.
A PyTorch implementation of a voice separation algorithm for mixed audio with multiple speakers.
Talon is a Python library for building voice interfaces and voice-driven applications.
Open-source API platform offering various services like Docker, IP, QR code, and more for developers.
A Neovim AI plugin that enables ChatGPT sessions, Instructable text/code operations, and Speech to Text functionality.
Get weekly updates on trending AI coding tools and projects.