Showing 1-20 of 69 projects
High-performance C/C++ port of OpenAI's Whisper for speech recognition
Offline speech-to-text engine for real-time on-device use
Faster Whisper transcription with CTranslate2 for efficient speech-to-text
WhisperX for fast ASR with word-level timestamps and diarization
An open-source personal assistant that provides AI-powered voice and text interactions.
A free, open-source, and extensible speech-to-text application that works offline.
A Python library that translates videos from one language to another, with support for dubbing and subtitles.
An open-source speech recognition toolkit used for building speech recognition systems.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.
An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.
A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.
Moshi is an open-source speech-to-text foundation model and dialogue framework for building AI-powered voice apps.
A simultaneous speech-to-text model powered by the Whisper AI library for real-time transcription.
A robust, efficient, and low-latency speech-to-text library with advanced voice activity detection and wake word activation.
Python speech recognition library supporting multiple engines and APIs, both online and offline.
A deep learning-based Chinese speech recognition system for developers working on AI-powered speech applications.
A multilingual voice understanding model for AI-powered audio analysis and transcription.
Speech recognition library for your web application, enabling voice interactions.
A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.
Get weekly updates on trending AI coding tools and projects.