Showing 1-20 of 49 projects
Comprehensive Chinese NLP resource collection for developers
WhisperX for fast ASR with word-level timestamps and diarization
A scalable generative AI framework for researchers and developers
An open-source speech recognition toolkit used for building speech recognition systems.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.
An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
A multilingual voice understanding model for AI-powered audio analysis and transcription.
An open-source Chinese voice assistant project that supports ChatGPT-like conversational abilities and brain-computer interface integration.
Fast and accurate automatic speech recognition (ASR) for edge devices
A Python API to get YouTube video transcripts without an API key or headless browser
HTML5 JavaScript recording library that supports multiple audio formats and provides features like ASR and DTMF.
Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper
A large language model-powered virtual salesperson that can generate product descriptions to drive user purchases.
A media player for language learning with AI-powered features like dual subtitles, real-time translation, and more.
A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.
A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.
A GUI for the faster-whisper library, which enables fast, open-source speech transcription using OpenAI's Whisper model.
Get weekly updates on trending AI coding tools and projects.