Showing 1-10 of 10 projects
A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.
An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.
Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.
Automagically synchronize subtitles with video using audio alignment and speech detection.
A GUI for the faster-whisper library, which enables fast, open-source speech transcription using OpenAI's Whisper model.
A lightweight, high-performance voice activity detector (VAD) library for conversational AI and real-time speech processing.
A TypeScript library for a voice activity detector (VAD) with a simple API for the browser.
Real-time speech recognition and voice activity detection for offline use on multiple platforms.
Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.
A Python library for efficient autonomous driving using vectorized scene representation.
Get weekly updates on trending AI coding tools and projects.