Explore Projects

Discover 188 open source projects

Active filters (1):
Search: synthesisร—
Clear all

Showing 21-40 of 188 projects

fudan-generative-vision/hallo

Generates hierarchical audio-driven visual synthesis for portrait image animation

8.6K
Archived
Python
React
#animation#face-animation#image-animation

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K
Archived
Python
AI Voice & Speech
Machine Learning Ops
PyTorch
#text-to-speech#multi-speaker#emotion

LiheYoung/Depth-Anything

A foundation model for monocular depth estimation that leverages large-scale unlabeled data.

8.0K
Archived
Python
Computer Vision
Python
#depth-estimation#image-synthesis#metric-depth-estimation

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.

8.0K
Archived
Python
LLM Wrappers & SDKs
AI Voice & Speech
Python
#emotional-speech#text-to-speech#transformer-architecture

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K
Archived
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#deep-learning#speech-synthesis

apple/ml-sharp

A Python library for fast, high-quality monocular view synthesis, useful for computer vision and 3D applications.

7.8K
Stable
Python
Computer Vision
#computer-vision#3d-reconstruction#view-synthesis

NVlabs/SPADE

SPADE is a Python library for semantic image synthesis, enabling high-quality generation of images from semantic segmentation maps.

7.7K
Archived
Python
Computer Vision
Animation & Motion
PyTorch
#computer-vision#image-synthesis#semantic-segmentation

jianchang512/ChatTTS-ui

A simple native web interface for ChatTTS text-to-speech synthesis with API support.

7.5K
Stable
Python
AI Code Editors
React
#ChatTTS#tts#AI Coding Tools

open-mmlab/mmagic

An open-source, multi-purpose AI creation toolbox for text-to-image, image/video processing, and more.

7.4K
Archived
Jupyter Notebook
Computer Vision
Generative AI
PyTorch
#text-to-image#image-generation#image-processing

vt-vl-lab/3d-photo-inpainting

A Python library for 3D photography using context-aware layered depth inpainting.

7.1K
Archived
Python
Computer Vision
Computer Vision
#3d-photo#novel-view-synthesis#computer-vision

openai/point-e

A Python library for generating 3D models from point clouds using diffusion models.

6.9K
Archived
Python
Computer Vision
ML Ops
Python
#point-cloud#3d-generation#diffusion-models

supercollider/supercollider

An open-source audio programming environment for sound synthesis and algorithmic composition.

6.4K
Active
C++
Music
IDE Extensions
#audio#synthesis#music

CompVis/taming-transformers

A library for high-resolution image synthesis using transformers, focused on computer vision applications.

6.4K
Archived
Jupyter Notebook
Computer Vision
API Frameworks
Jupyter Notebook
#computer-vision#image-synthesis#transformers

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K
Active
Python
AI Voice & Speech
CLI Tools
Apple MLX
#apple-silicon#speech-recognition#speech-synthesis

OpenBMB/VoxCPM

An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.

6.0K
Active
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#voice-cloning#speech-synthesis

snakers4/silero-models

Pre-trained text-to-speech models for various languages, made simple to use.

5.8K
Active
Jupyter Notebook
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#speech-synthesis#pre-trained-models

RodZill4/material-maker

A procedural textures authoring and 3D model painting tool based on the Godot game engine.

5.2K
Active
GDScript
Component Libraries (React)
Full-Stack Frameworks
Godot
#godot-engine#material-maker#procedural-generation

salesforce/CodeGen

CodeGen is an open-source family of models for program synthesis, competitive with OpenAI Codex.

5.2K
Stable
Python
LLM Frameworks
AI Code Generation
Python
#codex#generative-model#language-model

NVlabs/Sana

SANA is an efficient high-resolution image synthesis library using a linear diffusion transformer.

5.0K
Active
Python
Computer Vision
Inference
PyTorch
#diffusion#text-to-image-generation#transformers

MoonInTheRiver/DiffSinger

DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.

4.7K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#singing-synthesis#text-to-speech#diffusion-model
13...10

Stay in the loop

Get weekly updates on trending AI coding tools and projects.