Showing 181-188 of 188 projects
A zero-shot multi-speaker text-to-speech (TTS) and voice conversion library for developers.
An open-source AI-powered virtual YouTuber (VTuber) platform built with Python for streaming on YouTube and Twitch.
A GAN-based Mel-Spectrogram Inversion Network for high-quality text-to-speech synthesis.
A small fast portable speech synthesis system written in C.
A Flash + AIR sound effects generator based on Sfxr, used for game development and gamedev purposes.
Official implementation of a fashion image-to-video synthesis model using Stable Diffusion.
A PyTorch-based framework for non-autoregressive text-to-speech synthesis, including PortaSpeech and DiffSpeech models.
Sample code for Microsoft's Cognitive Speech-to-Text API, featuring custom neural voices and text-to-speech functionality.
Get weekly updates on trending AI coding tools and projects.