Explore Projects

Discover 228 open source projects

Active filters (1):
Search: voiceร—
Clear all

Showing 41-60 of 228 projects

livekit/agents

A powerful Python framework for building real-time voice AI agents powered by OpenAI and other AI tools.

9.6K
Active
Python
Agents & Orchestration
#agents#ai#openai

KoljaB/RealtimeSTT

A robust, efficient, and low-latency speech-to-text library with advanced voice activity detection and wake word activation.

9.5K
Experimental
Python
AI Voice & Speech
#realtime#speech-to-text#voice-activity-detection

voicepaw/so-vits-svc-fork

A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.

9.3K
Active
Python
AI Voice & Speech
PyTorch
#voice-conversion#speech-synthesis#realtime

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K
Active
Jupyter Notebook
Speech Processing
API Frameworks
PyTorch
#speech-recognition#speaker-diarization#audio-processing

benawad/dogehouse

An open-source voice chat application built with TypeScript, Elixir, and React.

9.1K
Archived
TypeScript
Component Libraries (React)
Realtime
React
#voice-chat#real-time#open-source

QwenLM/Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models that enable stable, expressive, and streaming speech generation.

9.0K
Active
Python
AI Voice & Speech
AI App Builders
Python
#tts#text-to-speech#speech-generation

jianchang512/clone-voice

A sound cloning tool that lets you use your voice or any sound to record audio, with a web interface.

8.9K
Stable
Python
AI Voice & Speech
Frontend Frameworks
React
#clonevoice#speech-analysis#tts

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K
Archived
Python
AI Voice & Speech
Machine Learning Ops
PyTorch
#text-to-speech#multi-speaker#emotion

snakers4/silero-vad

Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.

8.4K
Stable
Python
AI Voice & Speech
PyTorch
#speech-processing#voice-activity-detection#voice-commands

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.

8.0K
Archived
Python
LLM Wrappers & SDKs
AI Voice & Speech
Python
#emotional-speech#text-to-speech#transformer-architecture

mumble-voip/mumble

Mumble is an open-source, low-latency, high-quality voice chat software for gaming and communication.

7.8K
Active
C++
Realtime
Full-Stack Frameworks
CMake
#voip#voice-chat#open-source

feross/simple-peer

Simple WebRTC library for creating peer-to-peer data, video, and voice channels in the browser and Node.js.

7.8K
Archived
JavaScript
Frontend Frameworks
API Frameworks
React
#webrtc#p2p#data-channels

fonoster/fonoster

An open-source alternative to Twilio for cloud communications and programmable voice/telephony.

7.7K
Active
TypeScript
BaaS Platforms
API Clients & Testing
TypeScript
#cloud-communications#programmable-voice#telephony

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K
Stable
Python
AI Voice & Speech
API Frameworks
PyTorch
#speech-recognition#speech-emotion-recognition#audio-event-classification

smacke/ffsubsync

Automagically synchronize subtitles with video using audio alignment and speech detection.

7.6K
Stable
Python
AI Audio & Speech
API Frameworks
#audio-alignment#speech-detection#subtitle-synchronization

GetStream/Vision-Agents

Open-source platform for building low-latency vision AI agents using any model or video provider

7.3K
Active
Python
Agents & Orchestration
Computer Vision
Python
#agentic-ai#vision-ai#video-ai

wzpan/wukong-robot

An open-source Chinese voice assistant project that supports ChatGPT-like conversational abilities and brain-computer interface integration.

7.1K
Archived
Python
AI Voice & Speech
Agents & Orchestration
Python
#chatgpt#voice-assistant#brain-computer-interface

enricoros/big-AGI

AI suite with advanced AI/AGI functions, including personas, multi-model chats, text-to-image, voice, and more.

6.9K
Active
TypeScript
LLM Frameworks
Agents & Orchestration
TypeScript
#agi#ai-agents#ai-suite

TalAter/annyang

Speech recognition library for your web application, enabling voice interactions.

6.7K
Archived
JavaScript
Component Libraries (React)
AI Voice & Speech
React
#speech#speech-recognition#voice

MycroftAI/mycroft-core

Mycroft Core is an open-source AI platform for building voice assistants and smart home integrations.

6.6K
Archived
Python
AI Voice & Speech
API Frameworks
Python
#ai-assistant#voice-interface#open-source
124...12

Stay in the loop

Get weekly updates on trending AI coding tools and projects.