Explore Projects

Discover 23 open source projects

Active filters (1):

Search: voice-cloning×

Clear all

Showing 1-20 of 23 projects

CorentinJ/Real-Time-Voice-Cloning

Real-time voice cloning using deep learning

59.5K

Stable

Python

AI Voice & Speech

CLI Tools

PyTorch

#voice-cloning#tts#deep-learning

RVC-Boss/GPT-SoVITS

Few-shot voice cloning and TTS with 1 min training data

55.5K

Active

Python

AI Voice & Speech

#tts#voice-clone#few-shot

unslothai/unsloth

Fine-tuning & RL for LLMs with optimized performance and memory use

53.4K

Active

Python

Fine-tuning

#llm#fine-tuning#reinforcement-learning

coqui-ai/TTS

🐸TTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.

44.7K

Archived

Python

AI Voice & Speech

PyTorch

#text-to-speech#deep-learning#speech-synthesis

myshell-ai/OpenVoice

Instant voice cloning model with tone color cloning and multi-lingual support

36.0K

Experimental

Python

AI Voice & Speech

SaaS Boilerplates

Python

#voice-clone#text-to-speech#zero-shot-tts

FunAudioLLM/CosyVoice

Multilingual voice generation model with full-stack capabilities for TTS, training, and deployment

19.8K

Active

Python

AI Voice & Speech

Fine-tuning

PyTorch

#tts#voice-generation#multilingual

index-tts/index-tts

An efficient zero-shot text-to-speech system with fine-grained control over the generated voice.

19.1K

Stable

Python

AI Voice & Speech

Python

#text-to-speech#zero-shot#voice-cloning

DrewThomasson/ebook2audiobook

Converts e-books to audiobooks using AI voice cloning and supports over 1158 languages.

18.4K

Active

Python

React

#audiobook#voice-cloning#tts

Huanshere/VideoLingo

Fully automated AI video subtitle team with one-click subtitle cutting, translation, alignment, and dubbing.

16.1K

Experimental

Python

AI Video & Image

Python

#ai-translation#dubbing#localization

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-translation

jamiepine/voicebox

Open-source voice synthesis studio powered by Qwen3-TTS

12.2K

Active

TypeScript

Voice AI & Synthesis

Whisper

#qwen3-tts#voice-ai#mlx

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.

8.0K

Archived

Python

LLM Wrappers & SDKs

AI Voice & Speech

Python

#emotional-speech#text-to-speech#transformer-architecture

multimodal-art-projection/YuE

Open-source full-song music generation foundation model for developers building AI-powered audio applications.

6.1K

Experimental

Python

LLM Frameworks

Audio Generation

PyTorch

#music-generation#audio-generation#deep-learning

OpenBMB/VoxCPM

An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.

6.0K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#voice-cloning#speech-synthesis

IAHispano/Applio

A high-quality voice conversion tool focused on ease of use and performance for AI-powered audio applications.

3.0K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-to-speech#text-to-speech#voice-conversion

voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

1.4K

Archived

Python

Prompt Engineering

React

#text-to-speech#tts#voice-cloning

High-Logic/Genie-TTS

A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.

1.4K

Active

Python

AI Voice & Speech

CLI Tools

Python

#gpt-sovits#text-to-speech#tts

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K

Archived

AI Voice & Speech

Databases

#speech-recognition#speech-synthesis#speech-processing

Enemyx-net/VibeVoice-ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

1.4K

Stable

Python

AI Voice & Speech

ComfyUI Custom Nodes

React

#text-to-speech#voice-cloning#ComfyUI

MiniMax-AI/MiniMax-MCP

Official server for the MiniMax Model Context Protocol (MCP) that enables powerful AI capabilities like text-to-speech, image generation, and video generation.

1.3K

Active

Python

MCP Servers

AI Image & Video

Python

#mcp#text-to-speech#image-generation

Stay in the loop

Get weekly updates on trending AI coding tools and projects.