Explore Projects

Discover 54 open source projects

Active filters (1):
Search: speakerร—
Clear all

Showing 1-20 of 54 projects

coqui-ai/TTS

๐ŸธTTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.

44.7K
Archived
Python
AI Voice & Speech
PyTorch
#text-to-speech#deep-learning#speech-synthesis

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers

16.9K
Active
Python
React
#generative-ai#machine-learning#neural-networks

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K
Stable
Shell
Speech Recognition
#speech-recognition#speaker-identification#speaker-verification

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K
Stable
Jupyter Notebook
AI Voice & Speech
Node
#speech-recognition#voice-recognition#offline

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-translation

idootop/mi-gpt

An open-source project that connects the Xiaomi AI speaker to ChatGPT and Douban, turning it into a custom voice assistant.

12.2K
Stable
TypeScript
LLM Wrappers & SDKs
TypeScript
#voice-assistant#chatgpt#ai-speaker

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K
Active
Python
AI SDKs & Wrappers
PyTorch
#speech-recognition#audio-processing#deep-learning

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K
Active
C++
AI Voice & Speech
#speech-to-text#text-to-speech#offline

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K
Active
Rust
LLM Frameworks
#ai-meeting-assistant#transcription#speaker-diarization

mozilla/TTS

A deep learning library for text-to-speech applications, focusing on generating high-quality speech from text.

10.1K
Archived
Jupyter Notebook
AI Voice & Speech
PyTorch
#text-to-speech#speech-generation#deep-learning

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

hanxi/xiaomusic

An open-source Python project that allows playing music through a Xiaomi AI speaker using yt-dlp for downloading.

9.4K
Active
Python
API Frameworks
Backend Frameworks
Vue
#music#xiaoai#xiaomusic

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K
Active
Jupyter Notebook
Speech Processing
API Frameworks
PyTorch
#speech-recognition#speaker-diarization#audio-processing

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K
Archived
Python
AI Voice & Speech
Machine Learning Ops
PyTorch
#text-to-speech#multi-speaker#emotion

wzpan/wukong-robot

An open-source Chinese voice assistant project that supports ChatGPT-like conversational abilities and brain-computer interface integration.

7.1K
Archived
Python
AI Voice & Speech
Agents & Orchestration
Python
#chatgpt#voice-assistant#brain-computer-interface

yihong0618/xiaogpt

A Python library that allows developers to interact with ChatGPT and other large language models using a Xiaomi AI speaker.

6.8K
Stable
Python
LLM Frameworks
API Clients & Testing
Python
#chatgpt#llm#xiaomi-ai-speaker

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K
Stable
Jupyter Notebook
React
#asr#speaker-diarization#speech-recognition

TensorSpeech/TensorFlowTTS

A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.

4.0K
Archived
Python
AI Voice & Speech
TensorFlow
#speech-synthesis#text-to-speech#tts

modelscope/ClearerVoice-Studio

An open-source toolkit for speech processing, supporting enhancement, separation, and target speaker extraction.

4.0K
Stable
Python
AI Voice & Speech
PyTorch
#speech-enhancement#speech-separation#speaker-extraction

Stay in the loop

Get weekly updates on trending AI coding tools and projects.