Explore Projects

Discover 54 open source projects

Active filters (1):

Search: speaker×

Showing 1-20 of 54 projects

coqui-ai/TTS

🐸TTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.

44.7K

Archived

Python

AI Voice & Speech

PyTorch

#text-to-speech#deep-learning#speech-synthesis

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers

16.9K

Active

Python

React

#generative-ai#machine-learning#neural-networks

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K

Stable

Shell

Speech Recognition

#speech-recognition#speaker-identification#speaker-verification

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K

Active

Python

AI Voice & Speech

PyTorch

#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K

Stable

Jupyter Notebook

AI Voice & Speech

Node

#speech-recognition#voice-recognition#offline

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-translation

idootop/mi-gpt

An open-source project that connects the Xiaomi AI speaker to ChatGPT and Douban, turning it into a custom voice assistant.

12.2K

Stable

TypeScript

LLM Wrappers & SDKs

TypeScript

#voice-assistant#chatgpt#ai-speaker

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K

Active

Python

AI SDKs & Wrappers

PyTorch

#speech-recognition#audio-processing#deep-learning

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K

Active

C++

AI Voice & Speech

#speech-to-text#text-to-speech#offline

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K

Active

Rust

LLM Frameworks

#ai-meeting-assistant#transcription#speaker-diarization

mozilla/TTS

A deep learning library for text-to-speech applications, focusing on generating high-quality speech from text.

10.1K

Archived

Jupyter Notebook

AI Voice & Speech

PyTorch

#text-to-speech#speech-generation#deep-learning

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K

Active

Python

Speech & Voice

PyTorch

#speech-recognition#speech-synthesis#speech-translation

hanxi/xiaomusic

An open-source Python project that allows playing music through a Xiaomi AI speaker using yt-dlp for downloading.

9.4K

Active

Python

API Frameworks

Backend Frameworks

Vue

#music#xiaoai#xiaomusic

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K

Active

Jupyter Notebook

Speech Processing

API Frameworks

PyTorch

#speech-recognition#speaker-diarization#audio-processing

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K

Archived

Python

AI Voice & Speech

Machine Learning Ops

PyTorch

#text-to-speech#multi-speaker#emotion

wzpan/wukong-robot

An open-source Chinese voice assistant project that supports ChatGPT-like conversational abilities and brain-computer interface integration.

7.1K

Archived

Python

AI Voice & Speech

Agents & Orchestration

Python

#chatgpt#voice-assistant#brain-computer-interface

yihong0618/xiaogpt

A Python library that allows developers to interact with ChatGPT and other large language models using a Xiaomi AI speaker.

6.8K

Stable

Python

LLM Frameworks

API Clients & Testing

Python

#chatgpt#llm#xiaomi-ai-speaker

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K

Stable

Jupyter Notebook

React

#asr#speaker-diarization#speech-recognition

TensorSpeech/TensorFlowTTS

A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.

4.0K

Archived

Python

AI Voice & Speech

TensorFlow

#speech-synthesis#text-to-speech#tts

modelscope/ClearerVoice-Studio

An open-source toolkit for speech processing, supporting enhancement, separation, and target speaker extraction.

4.0K

Stable

Python

AI Voice & Speech

PyTorch

#speech-enhancement#speech-separation#speaker-extraction

2 3

Stay in the loop

Get weekly updates on trending AI coding tools and projects.