Explore Projects

Discover 124 open source projects

Active filters (1):
Search: text-to-speechร—
Clear all

Showing 21-40 of 124 projects

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K
Experimental
Python
AI Audio & Speech
Python
#audio-generation#speech-synthesis#text-to-speech

fishaudio/Bert-VITS2

Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.

8.7K
Active
Python
LLM Frameworks
React
#LLM#TTS#VITS2

jasonppy/VoiceCraft

A Jupyter Notebook project for zero-shot speech editing and text-to-speech using AI models.

8.5K
Experimental
Jupyter Notebook
AI Voice & Speech
Notebooks
#zero-shot#speech-editing#text-to-speech

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K
Archived
Python
AI Voice & Speech
Machine Learning Ops
PyTorch
#text-to-speech#multi-speaker#emotion

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.

8.0K
Archived
Python
LLM Wrappers & SDKs
AI Voice & Speech
Python
#emotional-speech#text-to-speech#transformer-architecture

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K
Archived
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#deep-learning#speech-synthesis

jianchang512/ChatTTS-ui

A simple native web interface for ChatTTS text-to-speech synthesis with API support.

7.5K
Stable
Python
AI Code Editors
React
#ChatTTS#tts#AI Coding Tools

myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library supporting English, Spanish, French, Chinese, Japanese and Korean.

7.2K
Archived
Python
AI Voice & Speech
Backend Frameworks
Python
#text-to-speech#multilingual#audio-generation

Zyphra/Zonos

Zonos is an open-source, high-quality text-to-speech model for developers building AI-powered applications.

7.2K
Experimental
Python
AI Voice & Speech
BaaS Platforms
Python
#text-to-speech#multilingual#open-source

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K
Active
Python
AI Voice & Speech
CLI Tools
Apple MLX
#apple-silicon#speech-recognition#speech-synthesis

OpenBMB/VoxCPM

An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.

6.0K
Active
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#voice-cloning#speech-synthesis

canopyai/Orpheus-TTS

Orpheus-TTS is a high-quality, real-time text-to-speech library for creating human-sounding AI voices.

6.0K
Stable
Python
AI Voice & Speech
Python
#llm#realtime#tts

snakers4/silero-models

Pre-trained text-to-speech models for various languages, made simple to use.

5.8K
Active
Jupyter Notebook
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#speech-synthesis#pre-trained-models

huggingface/parler-tts

Inference and training library for high-quality text-to-speech (TTS) models.

5.5K
Archived
Python
AI Voice & Speech
API Frameworks
Python
#text-to-speech#tts#speech-synthesis

promptslab/Awesome-Prompt-Engineering

This repository provides a curated collection of resources for Prompt Engineering with a focus on large language models like ChatGPT and GPT-3.

5.5K
Stable
Python
LLM Frameworks
Prompt Engineering
Python
#chatgpt#gpt-3#prompt-engineering

NVIDIA/tacotron2

A PyTorch implementation of Tacotron 2, a state-of-the-art text-to-speech model, with faster-than-realtime inference.

5.3K
Archived
Jupyter Notebook
Speech & Audio
Inference
PyTorch
#text-to-speech#audio-generation#machine-learning

MoonInTheRiver/DiffSinger

DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.

4.7K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#singing-synthesis#text-to-speech#diffusion-model

gradio-app/fastrtc

A Python library for building real-time communication applications using AI tools like speech-to-text and text-to-speech.

4.5K
Active
JavaScript
AI Voice & Speech
Realtime
Python
#real-time#speech-to-text#text-to-speech

remsky/Kokoro-FastAPI

A Dockerized FastAPI wrapper for the Kokoro-82M text-to-speech model with CPU and GPU support.

4.5K
Active
Python
AI Voice & Speech
FastAPI
#tts-api#fastapi#onnx

Stay in the loop

Get weekly updates on trending AI coding tools and projects.