Showing 1-13 of 13 projects
A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.
A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.
End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.
A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.
Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper
Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.
A library for single- and multi-modal speaker verification, recognition, and diarization.
An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.
A Python package for building real-time, AI-powered audio applications like speaker diarization and voice activity detection.
A curated list of resources for speaker diarization, a speech processing task to identify who spoke when.
Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.
A library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, for supervised speaker diarization.
A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.
Get weekly updates on trending AI coding tools and projects.