Showing 21-40 of 368 projects
Multilingual voice generation model with full-stack capabilities for TTS, training, and deployment
An ultra-realistic text-to-speech model for generating natural-sounding dialogue and audio.
An efficient zero-shot text-to-speech system with fine-grained control over the generated voice.
A set of Jupyter Notebooks that combine Grounding DINO, Segment Anything, and Stable Diffusion for automatic detection, segmentation, and generation of anything in images.
An open-source personal assistant that provides AI-powered voice and text interactions.
A free, open-source, and extensible speech-to-text application that works offline.
A scalable generative AI framework for researchers and developers
A Python library that translates videos from one language to another, with support for dubbing and subtitles.
An open-source speech recognition toolkit used for building speech recognition systems.
A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.
A multi-voice text-to-speech (TTS) system with a focus on high-quality audio output.
A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.
A conversational speech generation model for developers building AI-powered applications.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Official code for a text-to-speech model that generates fluent and faithful speech with flow matching.
An Electron-based WeChat client for macOS and Linux, providing a better user experience.
A lip sync generation tool that leverages AI to synchronize speech with video in the wild.
A comprehensive collection of deep learning, reinforcement learning, and machine learning resources for vibe coders.
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
Foundational models for state-of-the-art speech and text translation
Get weekly updates on trending AI coding tools and projects.