Explore Projects

Discover 51 open source projects

Active filters (1):
Search: speech-synthesisร—
Clear all

Showing 1-20 of 51 projects

coqui-ai/TTS

๐ŸธTTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.

44.7K
Archived
Python
AI Voice & Speech
PyTorch
#text-to-speech#deep-learning#speech-synthesis

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K
Active
TypeScript
AI Voice & Speech
Node
#open-source#virtual-assistant#speech-to-text

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers

16.9K
Active
Python
React
#generative-ai#machine-learning#neural-networks

NVIDIA/DeepLearningExamples

A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.

14.7K
Archived
Jupyter Notebook
ML Ops
PyTorch
#deep-learning#computer-vision#natural-language-processing

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-translation

rhasspy/piper

A fast, local neural text-to-speech system for developers building voice-enabled applications.

10.6K
Stable
C++
AI Voice & Speech
#text-to-speech#tts#speech-synthesis

rany2/edge-tts

A Python library that allows developers to use Microsoft Edge's online text-to-speech service without requiring Edge or an API key.

10.2K
Stable
Python
AI Voice & Speech
#text-to-speech#speech-synthesis#microsoft-edge

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K
Experimental
Python
AI Audio & Speech
Python
#audio-generation#speech-synthesis#text-to-speech

voicepaw/so-vits-svc-fork

A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.

9.3K
Active
Python
AI Voice & Speech
PyTorch
#voice-conversion#speech-synthesis#realtime

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K
Archived
Python
AI Voice & Speech
Machine Learning Ops
PyTorch
#text-to-speech#multi-speaker#emotion

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K
Archived
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#deep-learning#speech-synthesis

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K
Active
Python
AI Voice & Speech
CLI Tools
Apple MLX
#apple-silicon#speech-recognition#speech-synthesis

OpenBMB/VoxCPM

An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.

6.0K
Active
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#voice-cloning#speech-synthesis

snakers4/silero-models

Pre-trained text-to-speech models for various languages, made simple to use.

5.8K
Active
Jupyter Notebook
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#speech-synthesis#pre-trained-models

MoonInTheRiver/DiffSinger

DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.

4.7K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#singing-synthesis#text-to-speech#diffusion-model

huggingface/speech-to-speech

Open-source and modular AI-powered speech-to-speech translation tool built with Python.

4.5K
Experimental
Python
Speech & Voice
API Frameworks
Python
#speech-recognition#speech-synthesis#speech-translation

TensorSpeech/TensorFlowTTS

A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.

4.0K
Archived
Python
AI Voice & Speech
TensorFlow
#speech-synthesis#text-to-speech#tts

KoljaB/RealtimeTTS

A Python library that provides real-time text-to-speech conversion capabilities for developers.

3.8K
Active
Python
AI Voice & Speech
#realtime#speech-synthesis#text-to-speech

stakira/OpenUtau

An open-source successor to UTAU, a platform for singing voice synthesis and audio production.

3.6K
Active
C#
AI Voice & Speech
Audio Production
C#
#singing-synthesis#vocal-synthesis#speech-synthesis

Stay in the loop

Get weekly updates on trending AI coding tools and projects.