Explore Projects

Discover 228 open source projects

Active filters (1):

Search: voice×

Clear all

Showing 21-40 of 228 projects

DrewThomasson/ebook2audiobook

Converts e-books to audiobooks using AI voice cloning and supports over 1158 languages.

18.4K

Active

Python

React

#audiobook#voice-cloning#tts

livekit/livekit

An end-to-end realtime stack for connecting humans and AI, built with Go and WebRTC.

17.4K

Active

Realtime

#realtime#webrtc#media-server

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K

Active

TypeScript

AI Voice & Speech

Node

#open-source#virtual-assistant#speech-to-text

Huanshere/VideoLingo

Fully automated AI video subtitle team with one-click subtitle cutting, translation, alignment, and dubbing.

16.1K

Experimental

Python

AI Video & Image

Python

#ai-translation#dubbing#localization

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K

Active

Python

AI Voice & Speech

PyTorch

#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

neonbjb/tortoise-tts

A multi-voice text-to-speech (TTS) system with a focus on high-quality audio output.

14.8K

Archived

Jupyter Notebook

AI Voice & Speech

#text-to-speech#multi-voice#audio-quality

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K

Stable

Jupyter Notebook

AI Voice & Speech

Node

#speech-recognition#voice-recognition#offline

slopus/happy

A mobile and web client for Codex and Claude Code, with realtime voice, encryption, and full-featured functionality.

14.3K

Active

TypeScript

AI Code Editors

React

#claude-code#codex#realtime

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-translation

idootop/mi-gpt

An open-source project that connects the Xiaomi AI speaker to ChatGPT and Douban, turning it into a custom voice assistant.

12.2K

Stable

TypeScript

LLM Wrappers & SDKs

TypeScript

#voice-assistant#chatgpt#ai-speaker

jamiepine/voicebox

Open-source voice synthesis studio powered by Qwen3-TTS

12.2K

Active

TypeScript

Voice AI & Synthesis

Whisper

#qwen3-tts#voice-ai#mlx

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K

Active

Python

AI SDKs & Wrappers

PyTorch

#speech-recognition#audio-processing#deep-learning

rhasspy/piper

A fast, local neural text-to-speech system for developers building voice-enabled applications.

10.6K

Stable

C++

AI Voice & Speech

#text-to-speech#tts#speech-synthesis

pipecat-ai/pipecat

An open-source framework for building voice and multimodal conversational AI applications.

10.6K

Active

Python

LLM Frameworks

Python

#conversational-ai#voice-assistant#multimodal

TEN-framework/ten-framework

Open-source framework for building conversational voice AI agents

10.2K

Active

Python

React

#authentication#real-time#type-safe

RunanywhereAI/runanywhere-sdks

Production ready AI toolkit for local AI inference

10.2K

Active

Kotlin

AI Coding Tools

#agent-framework#android#apple-intelligence

noisetorch/NoiseTorch

A real-time microphone noise suppression tool for Linux developers, built with Go.

10.1K

Archived

Noise Reduction

#linux#noise-reduction#noise-suppression

kyutai-labs/moshi

Moshi is an open-source speech-to-text foundation model and dialogue framework for building AI-powered voice apps.

9.8K

Active

Python

LLM Frameworks

Python

#speech-to-text#dialogue-framework#audio-codec

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K

Active

Python

Speech & Voice

PyTorch

#speech-recognition#speech-synthesis#speech-translation

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K

Experimental

Python

AI Audio & Speech

Python

#audio-generation#speech-synthesis#text-to-speech

13...12

Stay in the loop

Get weekly updates on trending AI coding tools and projects.