Explore Projects

Discover 12 open source projects

Active filters (1):
Search: voice-activity-detectionร—
Clear all

Showing 1-12 of 12 projects

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

noisetorch/NoiseTorch

A real-time microphone noise suppression tool for Linux developers, built with Go.

10.1K
Archived
Go
Noise Reduction
#linux#noise-reduction#noise-suppression

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K
Active
Jupyter Notebook
Speech Processing
API Frameworks
PyTorch
#speech-recognition#speaker-diarization#audio-processing

snakers4/silero-vad

Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.

8.4K
Stable
Python
AI Voice & Speech
PyTorch
#speech-processing#voice-activity-detection#voice-commands

smacke/ffsubsync

Automagically synchronize subtitles with video using audio alignment and speech detection.

7.6K
Stable
Python
AI Audio & Speech
API Frameworks
#audio-alignment#speech-detection#subtitle-synchronization

TEN-framework/ten-vad

A lightweight, high-performance voice activity detector (VAD) library for conversational AI and real-time speech processing.

2.0K
Stable
C
AI Voice & Speech
API Frameworks
C
#audio#speech-processing#real-time

juanmc2005/diart

A Python package for building real-time, AI-powered audio applications like speaker diarization and voice activity detection.

1.9K
Experimental
Python
AI Voice & Speech
API Frameworks
#real-time#speaker-diarization#speaker-embedding

ricky0123/vad

A TypeScript library for a voice activity detector (VAD) with a simple API for the browser.

1.9K
Active
TypeScript
AI Voice & Speech
Frontend Frameworks
TypeScript
#speech-to-text#voice-activity-detection#web-audio-api

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection for offline use on multiple platforms.

1.6K
Stable
C++
AI Voice & Speech
Cross-Platform
#speech-recognition#voice-activity-detection#offline

iamsrikanthnani/pluely

An open-source AI assistant that works seamlessly in meetings, interviews, and conversations without detection.

1.6K
Active
TypeScript
AI Assistants
Desktop Apps
React
#ai-assistant#undetectable#privacy-first

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K
Active
Swift
AI Voice & Speech
iOS
Swift
#text-to-speech#speech-to-text#voice-activity-detection

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K
Archived
AI Voice & Speech
Databases
#speech-recognition#speech-synthesis#speech-processing

Stay in the loop

Get weekly updates on trending AI coding tools and projects.