Explore Projects

Discover 51 open source projects

Active filters (1):

Search: speech-synthesis×

Showing 1-20 of 51 projects

coqui-ai/TTS

🐸TTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.

44.7K

Archived

Python

AI Voice & Speech

PyTorch

#text-to-speech#deep-learning#speech-synthesis

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K

Active

TypeScript

AI Voice & Speech

Node

#open-source#virtual-assistant#speech-to-text

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers

16.9K

Active

Python

React

#generative-ai#machine-learning#neural-networks

NVIDIA/DeepLearningExamples

A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.

14.7K

Archived

Jupyter Notebook

ML Ops

PyTorch

#deep-learning#computer-vision#natural-language-processing

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-translation

rhasspy/piper

A fast, local neural text-to-speech system for developers building voice-enabled applications.

10.6K

Stable

C++

AI Voice & Speech

#text-to-speech#tts#speech-synthesis

rany2/edge-tts

A Python library that allows developers to use Microsoft Edge's online text-to-speech service without requiring Edge or an API key.

10.2K

Stable

Python

AI Voice & Speech

#text-to-speech#speech-synthesis#microsoft-edge

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K

Active

Python

Speech & Voice

PyTorch

#speech-recognition#speech-synthesis#speech-translation

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K

Experimental

Python

AI Audio & Speech

Python

#audio-generation#speech-synthesis#text-to-speech

voicepaw/so-vits-svc-fork

A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.

9.3K

Active

Python

AI Voice & Speech

PyTorch

#voice-conversion#speech-synthesis#realtime

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K

Archived

Python

AI Voice & Speech

Machine Learning Ops

PyTorch

#text-to-speech#multi-speaker#emotion

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K

Archived

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#deep-learning#speech-synthesis

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K

Active

Python

AI Voice & Speech

CLI Tools

Apple MLX

#apple-silicon#speech-recognition#speech-synthesis

OpenBMB/VoxCPM

An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.

6.0K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#voice-cloning#speech-synthesis

snakers4/silero-models

Pre-trained text-to-speech models for various languages, made simple to use.

5.8K

Active

Jupyter Notebook

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#speech-synthesis#pre-trained-models

MoonInTheRiver/DiffSinger

DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.

4.7K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#singing-synthesis#text-to-speech#diffusion-model

huggingface/speech-to-speech

Open-source and modular AI-powered speech-to-speech translation tool built with Python.

4.5K

Experimental

Python

Speech & Voice

API Frameworks

Python

#speech-recognition#speech-synthesis#speech-translation

TensorSpeech/TensorFlowTTS

A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.

4.0K

Archived

Python

AI Voice & Speech

TensorFlow

#speech-synthesis#text-to-speech#tts

KoljaB/RealtimeTTS

A Python library that provides real-time text-to-speech conversion capabilities for developers.

3.8K

Active

Python

AI Voice & Speech

#realtime#speech-synthesis#text-to-speech

stakira/OpenUtau

An open-source successor to UTAU, a platform for singing voice synthesis and audio production.

3.6K

Active

AI Voice & Speech

Audio Production

#singing-synthesis#vocal-synthesis#speech-synthesis

2 3

Stay in the loop

Get weekly updates on trending AI coding tools and projects.