Explore Projects

Discover 228 open source projects

Active filters (1):
Search: voiceร—
Clear all

Showing 21-40 of 228 projects

DrewThomasson/ebook2audiobook

Converts e-books to audiobooks using AI voice cloning and supports over 1158 languages.

18.4K
Active
Python
React
#audiobook#voice-cloning#tts

livekit/livekit

An end-to-end realtime stack for connecting humans and AI, built with Go and WebRTC.

17.4K
Active
Go
Realtime
#realtime#webrtc#media-server

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K
Active
TypeScript
AI Voice & Speech
Node
#open-source#virtual-assistant#speech-to-text

Huanshere/VideoLingo

Fully automated AI video subtitle team with one-click subtitle cutting, translation, alignment, and dubbing.

16.1K
Experimental
Python
AI Video & Image
Python
#ai-translation#dubbing#localization

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

neonbjb/tortoise-tts

A multi-voice text-to-speech (TTS) system with a focus on high-quality audio output.

14.8K
Archived
Jupyter Notebook
AI Voice & Speech
#text-to-speech#multi-voice#audio-quality

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K
Stable
Jupyter Notebook
AI Voice & Speech
Node
#speech-recognition#voice-recognition#offline

slopus/happy

A mobile and web client for Codex and Claude Code, with realtime voice, encryption, and full-featured functionality.

14.3K
Active
TypeScript
AI Code Editors
React
#claude-code#codex#realtime

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-translation

idootop/mi-gpt

An open-source project that connects the Xiaomi AI speaker to ChatGPT and Douban, turning it into a custom voice assistant.

12.2K
Stable
TypeScript
LLM Wrappers & SDKs
TypeScript
#voice-assistant#chatgpt#ai-speaker

jamiepine/voicebox

Open-source voice synthesis studio powered by Qwen3-TTS

12.2K
Active
TypeScript
Voice AI & Synthesis
Whisper
#qwen3-tts#voice-ai#mlx

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K
Active
Python
AI SDKs & Wrappers
PyTorch
#speech-recognition#audio-processing#deep-learning

rhasspy/piper

A fast, local neural text-to-speech system for developers building voice-enabled applications.

10.6K
Stable
C++
AI Voice & Speech
#text-to-speech#tts#speech-synthesis

pipecat-ai/pipecat

An open-source framework for building voice and multimodal conversational AI applications.

10.6K
Active
Python
LLM Frameworks
Python
#conversational-ai#voice-assistant#multimodal

TEN-framework/ten-framework

Open-source framework for building conversational voice AI agents

10.2K
Active
Python
React
#authentication#real-time#type-safe

RunanywhereAI/runanywhere-sdks

Production ready AI toolkit for local AI inference

10.2K
Active
Kotlin
AI Coding Tools
#agent-framework#android#apple-intelligence

noisetorch/NoiseTorch

A real-time microphone noise suppression tool for Linux developers, built with Go.

10.1K
Archived
Go
Noise Reduction
#linux#noise-reduction#noise-suppression

kyutai-labs/moshi

Moshi is an open-source speech-to-text foundation model and dialogue framework for building AI-powered voice apps.

9.8K
Active
Python
LLM Frameworks
Python
#speech-to-text#dialogue-framework#audio-codec

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K
Experimental
Python
AI Audio & Speech
Python
#audio-generation#speech-synthesis#text-to-speech
13...12

Stay in the loop

Get weekly updates on trending AI coding tools and projects.