Showing 21-40 of 188 projects
Generates hierarchical audio-driven visual synthesis for portrait image animation
EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.
A foundation model for monocular depth estimation that leverages large-scale unlabeled data.
An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.
A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.
A Python library for fast, high-quality monocular view synthesis, useful for computer vision and 3D applications.
SPADE is a Python library for semantic image synthesis, enabling high-quality generation of images from semantic segmentation maps.
A simple native web interface for ChatTTS text-to-speech synthesis with API support.
An open-source, multi-purpose AI creation toolbox for text-to-image, image/video processing, and more.
A Python library for 3D photography using context-aware layered depth inpainting.
A Python library for generating 3D models from point clouds using diffusion models.
An open-source audio programming environment for sound synthesis and algorithmic composition.
A library for high-resolution image synthesis using transformers, focused on computer vision applications.
A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.
An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.
Pre-trained text-to-speech models for various languages, made simple to use.
A procedural textures authoring and 3D model painting tool based on the Godot game engine.
CodeGen is an open-source family of models for program synthesis, competitive with OpenAI Codex.
SANA is an efficient high-resolution image synthesis library using a linear diffusion transformer.
DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.
Get weekly updates on trending AI coding tools and projects.