Showing 41-51 of 51 projects
A Python library for speech restoration, including tasks like declipping, denoising, and dereverberation.
A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.
An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.
Official PyTorch implementation of BigVGAN, a neural vocoder for generating high-quality audio, music, and speech.
Offline Russian voice assistant with plugin-based skills for developers working with AI tools.
A PyTorch-based library for zero-shot voice style transfer using only autoencoder loss.
A zero-shot multi-speaker text-to-speech (TTS) and voice conversion library for developers.
An open-source AI-powered virtual YouTuber (VTuber) platform built with Python for streaming on YouTube and Twitch.
A GAN-based Mel-Spectrogram Inversion Network for high-quality text-to-speech synthesis.
A PyTorch-based framework for non-autoregressive text-to-speech synthesis, including PortaSpeech and DiffSpeech models.
Sample code for Microsoft's Cognitive Speech-to-Text API, featuring custom neural voices and text-to-speech functionality.
Get weekly updates on trending AI coding tools and projects.