Explore Projects

Discover 13 open source projects

Active filters (1):
Search: speaker-diarizationร—
Clear all

Showing 1-13 of 13 projects

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K
Active
Python
AI SDKs & Wrappers
PyTorch
#speech-recognition#audio-processing#deep-learning

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K
Active
Jupyter Notebook
Speech Processing
API Frameworks
PyTorch
#speech-recognition#speaker-diarization#audio-processing

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K
Stable
Jupyter Notebook
React
#asr#speaker-diarization#speech-recognition

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K
Stable
Desktop Model Runners
AI Voice & Speech
Whisper
#speech-to-text#whisper#faster-whisper

modelscope/3D-Speaker

A library for single- and multi-modal speaker verification, recognition, and diarization.

2.8K
Stable
Python
Computer Vision
AI Voice & Speech
Python
#speaker-verification#speaker-recognition#speaker-diarization

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K
Stable
Python
AI Voice & Speech
CLI Tools
PyTorch
#speech-recognition#multilingual#transformers

juanmc2005/diart

A Python package for building real-time, AI-powered audio applications like speaker diarization and voice activity detection.

1.9K
Experimental
Python
AI Voice & Speech
API Frameworks
#real-time#speaker-diarization#speaker-embedding

wq2012/awesome-diarization

A curated list of resources for speaker diarization, a speech processing task to identify who spoke when.

1.9K
Experimental
Speech Processing
Awesome Lists
#speaker-diarization#speech-recognition#machine-learning

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K
Active
Swift
AI Voice & Speech
iOS
Swift
#text-to-speech#speech-to-text#voice-activity-detection

google/uis-rnn

A library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, for supervised speaker diarization.

1.6K
Archived
Python
LLM Frameworks
API Frameworks
Python
#clustering#machine-learning#speaker-diarization

wenet-e2e/wespeaker

A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.

1.2K
Active
Python
Speech & Voice
API Frameworks
PyTorch
#speech-recognition#speaker-verification#speaker-diarization

Stay in the loop

Get weekly updates on trending AI coding tools and projects.