Explore Projects

Discover 124 open source projects

Active filters (1):

Search: text-to-speech×

Clear all

Showing 21-40 of 124 projects

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K

Active

Python

Speech & Voice

PyTorch

#speech-recognition#speech-synthesis#speech-translation

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K

Experimental

Python

AI Audio & Speech

Python

#audio-generation#speech-synthesis#text-to-speech

fishaudio/Bert-VITS2

Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.

8.7K

Active

Python

LLM Frameworks

React

#LLM#TTS#VITS2

jasonppy/VoiceCraft

A Jupyter Notebook project for zero-shot speech editing and text-to-speech using AI models.

8.5K

Experimental

Jupyter Notebook

AI Voice & Speech

Notebooks

#zero-shot#speech-editing#text-to-speech

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K

Archived

Python

AI Voice & Speech

Machine Learning Ops

PyTorch

#text-to-speech#multi-speaker#emotion

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.

8.0K

Archived

Python

LLM Wrappers & SDKs

AI Voice & Speech

Python

#emotional-speech#text-to-speech#transformer-architecture

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K

Archived

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#deep-learning#speech-synthesis

jianchang512/ChatTTS-ui

A simple native web interface for ChatTTS text-to-speech synthesis with API support.

7.5K

Stable

Python

AI Code Editors

React

#ChatTTS#tts#AI Coding Tools

myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library supporting English, Spanish, French, Chinese, Japanese and Korean.

7.2K

Archived

Python

AI Voice & Speech

Backend Frameworks

Python

#text-to-speech#multilingual#audio-generation

Zyphra/Zonos

Zonos is an open-source, high-quality text-to-speech model for developers building AI-powered applications.

7.2K

Experimental

Python

AI Voice & Speech

BaaS Platforms

Python

#text-to-speech#multilingual#open-source

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K

Active

Python

AI Voice & Speech

CLI Tools

Apple MLX

#apple-silicon#speech-recognition#speech-synthesis

OpenBMB/VoxCPM

An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.

6.0K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#voice-cloning#speech-synthesis

canopyai/Orpheus-TTS

Orpheus-TTS is a high-quality, real-time text-to-speech library for creating human-sounding AI voices.

6.0K

Stable

Python

AI Voice & Speech

Python

#llm#realtime#tts

snakers4/silero-models

Pre-trained text-to-speech models for various languages, made simple to use.

5.8K

Active

Jupyter Notebook

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#speech-synthesis#pre-trained-models

huggingface/parler-tts

Inference and training library for high-quality text-to-speech (TTS) models.

5.5K

Archived

Python

AI Voice & Speech

API Frameworks

Python

#text-to-speech#tts#speech-synthesis

promptslab/Awesome-Prompt-Engineering

This repository provides a curated collection of resources for Prompt Engineering with a focus on large language models like ChatGPT and GPT-3.

5.5K

Stable

Python

LLM Frameworks

Prompt Engineering

Python

#chatgpt#gpt-3#prompt-engineering

NVIDIA/tacotron2

A PyTorch implementation of Tacotron 2, a state-of-the-art text-to-speech model, with faster-than-realtime inference.

5.3K

Archived

Jupyter Notebook

Speech & Audio

Inference

PyTorch

#text-to-speech#audio-generation#machine-learning

MoonInTheRiver/DiffSinger

DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.

4.7K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#singing-synthesis#text-to-speech#diffusion-model

gradio-app/fastrtc

A Python library for building real-time communication applications using AI tools like speech-to-text and text-to-speech.

4.5K

Active

JavaScript

AI Voice & Speech

Realtime

Python

#real-time#speech-to-text#text-to-speech

remsky/Kokoro-FastAPI

A Dockerized FastAPI wrapper for the Kokoro-82M text-to-speech model with CPU and GPU support.

4.5K

Active

Python

AI Voice & Speech

FastAPI

#tts-api#fastapi#onnx

13 4 5 6 7

Stay in the loop

Get weekly updates on trending AI coding tools and projects.