Showing 1-12 of 12 projects
A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.
A real-time microphone noise suppression tool for Linux developers, built with Go.
A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.
Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.
Automagically synchronize subtitles with video using audio alignment and speech detection.
A lightweight, high-performance voice activity detector (VAD) library for conversational AI and real-time speech processing.
A Python package for building real-time, AI-powered audio applications like speaker diarization and voice activity detection.
A TypeScript library for a voice activity detector (VAD) with a simple API for the browser.
Real-time speech recognition and voice activity detection for offline use on multiple platforms.
An open-source AI assistant that works seamlessly in meetings, interviews, and conversations without detection.
Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.
A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.
Get weekly updates on trending AI coding tools and projects.