Explore Projects

Discover 10 open source projects

Active filters (1):
Search: vadร—
Clear all

Showing 1-10 of 10 projects

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K
Active
C++
AI Voice & Speech
#speech-to-text#text-to-speech#offline

snakers4/silero-vad

Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.

8.4K
Stable
Python
AI Voice & Speech
PyTorch
#speech-processing#voice-activity-detection#voice-commands

smacke/ffsubsync

Automagically synchronize subtitles with video using audio alignment and speech detection.

7.6K
Stable
Python
AI Audio & Speech
API Frameworks
#audio-alignment#speech-detection#subtitle-synchronization

CheshireCC/faster-whisper-GUI

A GUI for the faster-whisper library, which enables fast, open-source speech transcription using OpenAI's Whisper model.

2.9K
Archived
Python
AI Voice & Speech
AI App Builders
PySide6
#speech-transcription#openai-whisper#voice-activity-detection

TEN-framework/ten-vad

A lightweight, high-performance voice activity detector (VAD) library for conversational AI and real-time speech processing.

2.0K
Stable
C
AI Voice & Speech
API Frameworks
C
#audio#speech-processing#real-time

ricky0123/vad

A TypeScript library for a voice activity detector (VAD) with a simple API for the browser.

1.9K
Active
TypeScript
AI Voice & Speech
Frontend Frameworks
TypeScript
#speech-to-text#voice-activity-detection#web-audio-api

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection for offline use on multiple platforms.

1.6K
Stable
C++
AI Voice & Speech
Cross-Platform
#speech-recognition#voice-activity-detection#offline

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K
Active
Swift
AI Voice & Speech
iOS
Swift
#text-to-speech#speech-to-text#voice-activity-detection

hustvl/VAD

A Python library for efficient autonomous driving using vectorized scene representation.

1.2K
Active
Python
Computer Vision
API Frameworks
#autonomous-driving#computer-vision#efficient-algorithms

Stay in the loop

Get weekly updates on trending AI coding tools and projects.