Showing 101-120 of 368 projects
DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.
Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.
A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.
A Python library for building real-time communication applications using AI tools like speech-to-text and text-to-speech.
A Dockerized FastAPI wrapper for the Kokoro-82M text-to-speech model with CPU and GPU support.
Open-source and modular AI-powered speech-to-speech translation tool built with Python.
OptiKey is a C# library for full computer control and speech with your eyes.
A fast, multimodal LLM for real-time voice applications and AI-powered speech tools.
A small speech recognition library written in C that can be used in a variety of applications.
A voice-to-text app for macOS that transcribes speech to text almost instantly.
A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.
A Chinese NLP library for tokenization, part-of-speech tagging, named entity recognition, and lexical analysis.
An open-source toolkit for speech processing, supporting enhancement, separation, and target speaker extraction.
A very low-bitrate speech codec for efficient audio compression, useful for various applications.
An end-to-end multimodal AI model that can understand and generate text, audio, vision, and video in real-time.
A deep learning-based noise suppression library for audio and speech enhancement applications.
A nearly-live implementation of OpenAI's Whisper, a powerful speech recognition and translation tool.
A Python library that provides real-time text-to-speech conversion capabilities for developers.
An open-source successor to UTAU, a platform for singing voice synthesis and audio production.
A curated collection of Python scripts from basics to advanced, including automation tasks.
Get weekly updates on trending AI coding tools and projects.