Explore Projects

Discover 188 open source projects

Active filters (1):
Search: synthesisร—
Clear all

Showing 1-20 of 188 projects

coqui-ai/TTS

๐ŸธTTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.

44.7K
Archived
Python
AI Voice & Speech
PyTorch
#text-to-speech#deep-learning#speech-synthesis

Stability-AI/generative-models

Stability AI's advanced 4D video generation model for high-fidelity novel-view synthesis

27.0K
Stable
Python
Computer Vision
AI Image & Video
#generative-models#video-generation#4d-synthesis

microsoft/VibeVoice

Open-source voice AI models for speech synthesis and recognition

23.6K
Active
Python
AI Voice & Speech
#voice-ai#speech-synthesis#speech-recognition

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K
Active
TypeScript
AI Voice & Speech
Node
#open-source#virtual-assistant#speech-to-text

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers

16.9K
Active
Python
React
#generative-ai#machine-learning#neural-networks

HumanAIGC/AnimateAnyone

A library for consistently and controllably animating images into videos of characters.

14.8K
Stable
Computer Vision
#image-to-video#character-animation#computer-vision

NVIDIA/DeepLearningExamples

A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.

14.7K
Archived
Jupyter Notebook
ML Ops
PyTorch
#deep-learning#computer-vision#natural-language-processing

Tonejs/Tone.js

Tone.js is a Web Audio framework for building interactive music and audio applications in the browser.

14.7K
Active
TypeScript
Animation & Motion
JavaScript
#web-audio#music#samples

CompVis/latent-diffusion

A high-performance latent diffusion model for generating high-resolution images.

13.9K
Archived
Jupyter Notebook
Computer Vision
#computer-vision#generative-ai#diffusion-models

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-translation

duixcom/Duix-Avatar

An open-source AI avatar toolkit for offline video generation and digital human cloning.

12.4K
Stable
C
AI Image & Video
#ai-avatar#ai-avatars#cloning

jamiepine/voicebox

Open-source voice synthesis studio powered by Qwen3-TTS

12.2K
Active
TypeScript
Voice AI & Synthesis
Whisper
#qwen3-tts#voice-ai#mlx

sonic-pi-net/sonic-pi

Sonic Pi is a live coding environment for creating music and sound using Ruby.

11.7K
Stable
C++
Live Coding
#music#audio#live-coding

AudioKit/AudioKit

AudioKit is an audio synthesis, processing, and analysis platform for iOS, macOS, and tvOS

11.3K
Stable
Swift
#audio#music#synthesis

rhasspy/piper

A fast, local neural text-to-speech system for developers building voice-enabled applications.

10.6K
Stable
C++
AI Voice & Speech
#text-to-speech#tts#speech-synthesis

rany2/edge-tts

A Python library that allows developers to use Microsoft Edge's online text-to-speech service without requiring Edge or an API key.

10.2K
Stable
Python
AI Voice & Speech
#text-to-speech#speech-synthesis#microsoft-edge

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K
Experimental
Python
AI Audio & Speech
Python
#audio-generation#speech-synthesis#text-to-speech

voicepaw/so-vits-svc-fork

A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.

9.3K
Active
Python
AI Voice & Speech
PyTorch
#voice-conversion#speech-synthesis#realtime

fishaudio/Bert-VITS2

Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.

8.7K
Active
Python
LLM Frameworks
React
#LLM#TTS#VITS2
2...10

Stay in the loop

Get weekly updates on trending AI coding tools and projects.