Explore Projects

Discover 188 open source projects

Active filters (1):

Search: synthesis×

Clear all

Showing 21-40 of 188 projects

fudan-generative-vision/hallo

Generates hierarchical audio-driven visual synthesis for portrait image animation

8.6K

Archived

Python

React

#animation#face-animation#image-animation

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K

Archived

Python

AI Voice & Speech

Machine Learning Ops

PyTorch

#text-to-speech#multi-speaker#emotion

LiheYoung/Depth-Anything

A foundation model for monocular depth estimation that leverages large-scale unlabeled data.

8.0K

Archived

Python

Computer Vision

Python

#depth-estimation#image-synthesis#metric-depth-estimation

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.

8.0K

Archived

Python

LLM Wrappers & SDKs

AI Voice & Speech

Python

#emotional-speech#text-to-speech#transformer-architecture

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K

Archived

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#deep-learning#speech-synthesis

apple/ml-sharp

A Python library for fast, high-quality monocular view synthesis, useful for computer vision and 3D applications.

7.8K

Stable

Python

Computer Vision

#computer-vision#3d-reconstruction#view-synthesis

NVlabs/SPADE

SPADE is a Python library for semantic image synthesis, enabling high-quality generation of images from semantic segmentation maps.

7.7K

Archived

Python

Computer Vision

Animation & Motion

PyTorch

#computer-vision#image-synthesis#semantic-segmentation

jianchang512/ChatTTS-ui

A simple native web interface for ChatTTS text-to-speech synthesis with API support.

7.5K

Stable

Python

AI Code Editors

React

#ChatTTS#tts#AI Coding Tools

open-mmlab/mmagic

An open-source, multi-purpose AI creation toolbox for text-to-image, image/video processing, and more.

7.4K

Archived

Jupyter Notebook

Computer Vision

Generative AI

PyTorch

#text-to-image#image-generation#image-processing

vt-vl-lab/3d-photo-inpainting

A Python library for 3D photography using context-aware layered depth inpainting.

7.1K

Archived

Python

Computer Vision

#3d-photo#novel-view-synthesis#computer-vision

openai/point-e

A Python library for generating 3D models from point clouds using diffusion models.

6.9K

Archived

Python

Computer Vision

ML Ops

Python

#point-cloud#3d-generation#diffusion-models

supercollider/supercollider

An open-source audio programming environment for sound synthesis and algorithmic composition.

6.4K

Active

C++

Music

IDE Extensions

#audio#synthesis#music

CompVis/taming-transformers

A library for high-resolution image synthesis using transformers, focused on computer vision applications.

6.4K

Archived

Jupyter Notebook

Computer Vision

API Frameworks

Jupyter Notebook

#computer-vision#image-synthesis#transformers

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K

Active

Python

AI Voice & Speech

CLI Tools

Apple MLX

#apple-silicon#speech-recognition#speech-synthesis

OpenBMB/VoxCPM

An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.

6.0K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#voice-cloning#speech-synthesis

snakers4/silero-models

Pre-trained text-to-speech models for various languages, made simple to use.

5.8K

Active

Jupyter Notebook

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#speech-synthesis#pre-trained-models

RodZill4/material-maker

A procedural textures authoring and 3D model painting tool based on the Godot game engine.

5.2K

Active

GDScript

Component Libraries (React)

Full-Stack Frameworks

Godot

#godot-engine#material-maker#procedural-generation

salesforce/CodeGen

CodeGen is an open-source family of models for program synthesis, competitive with OpenAI Codex.

5.2K

Stable

Python

LLM Frameworks

AI Code Generation

Python

#codex#generative-model#language-model

NVlabs/Sana

SANA is an efficient high-resolution image synthesis library using a linear diffusion transformer.

5.0K

Active

Python

Computer Vision

Inference

PyTorch

#diffusion#text-to-image-generation#transformers

MoonInTheRiver/DiffSinger

DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.

4.7K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#singing-synthesis#text-to-speech#diffusion-model

13...10

Stay in the loop

Get weekly updates on trending AI coding tools and projects.