Explore Projects

Discover 188 open source projects

Active filters (1):

Search: synthesis×

Showing 1-20 of 188 projects

coqui-ai/TTS

🐸TTS is a deep learning toolkit for advanced Text-to-Speech generation with 1100+ languages and tools for training and fine-tuning models.

44.7K

Archived

Python

AI Voice & Speech

PyTorch

#text-to-speech#deep-learning#speech-synthesis

Stability-AI/generative-models

Stability AI's advanced 4D video generation model for high-fidelity novel-view synthesis

27.0K

Stable

Python

Computer Vision

AI Image & Video

#generative-models#video-generation#4d-synthesis

microsoft/VibeVoice

Open-source voice AI models for speech synthesis and recognition

23.6K

Active

Python

AI Voice & Speech

#voice-ai#speech-synthesis#speech-recognition

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K

Active

TypeScript

AI Voice & Speech

Node

#open-source#virtual-assistant#speech-to-text

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers

16.9K

Active

Python

React

#generative-ai#machine-learning#neural-networks

HumanAIGC/AnimateAnyone

A library for consistently and controllably animating images into videos of characters.

14.8K

Stable

Computer Vision

#image-to-video#character-animation#computer-vision

NVIDIA/DeepLearningExamples

A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.

14.7K

Archived

Jupyter Notebook

ML Ops

PyTorch

#deep-learning#computer-vision#natural-language-processing

Tonejs/Tone.js

Tone.js is a Web Audio framework for building interactive music and audio applications in the browser.

14.7K

Active

TypeScript

Animation & Motion

JavaScript

#web-audio#music#samples

CompVis/latent-diffusion

A high-performance latent diffusion model for generating high-resolution images.

13.9K

Archived

Jupyter Notebook

Computer Vision

#computer-vision#generative-ai#diffusion-models

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-translation

duixcom/Duix-Avatar

An open-source AI avatar toolkit for offline video generation and digital human cloning.

12.4K

Stable

AI Image & Video

#ai-avatar#ai-avatars#cloning

jamiepine/voicebox

Open-source voice synthesis studio powered by Qwen3-TTS

12.2K

Active

TypeScript

Voice AI & Synthesis

Whisper

#qwen3-tts#voice-ai#mlx

sonic-pi-net/sonic-pi

Sonic Pi is a live coding environment for creating music and sound using Ruby.

11.7K

Stable

C++

Live Coding

#music#audio#live-coding

AudioKit/AudioKit

AudioKit is an audio synthesis, processing, and analysis platform for iOS, macOS, and tvOS

11.3K

Stable

Swift

#audio#music#synthesis

rhasspy/piper

A fast, local neural text-to-speech system for developers building voice-enabled applications.

10.6K

Stable

C++

AI Voice & Speech

#text-to-speech#tts#speech-synthesis

rany2/edge-tts

A Python library that allows developers to use Microsoft Edge's online text-to-speech service without requiring Edge or an API key.

10.2K

Stable

Python

AI Voice & Speech

#text-to-speech#speech-synthesis#microsoft-edge

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K

Active

Python

Speech & Voice

PyTorch

#speech-recognition#speech-synthesis#speech-translation

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K

Experimental

Python

AI Audio & Speech

Python

#audio-generation#speech-synthesis#text-to-speech

voicepaw/so-vits-svc-fork

A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.

9.3K

Active

Python

AI Voice & Speech

PyTorch

#voice-conversion#speech-synthesis#realtime

fishaudio/Bert-VITS2

Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.

8.7K

Active

Python

LLM Frameworks

React

#LLM#TTS#VITS2

2...10

Stay in the loop

Get weekly updates on trending AI coding tools and projects.