Explore Projects

Discover 13 open source projects

Active filters (1):

Search: speaker-diarization×

Clear all

Showing 1-13 of 13 projects

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K

Active

Python

AI Voice & Speech

PyTorch

#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K

Active

Python

AI SDKs & Wrappers

PyTorch

#speech-recognition#audio-processing#deep-learning

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K

Active

Python

Speech & Voice

PyTorch

#speech-recognition#speech-synthesis#speech-translation

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K

Active

Jupyter Notebook

Speech Processing

API Frameworks

PyTorch

#speech-recognition#speaker-diarization#audio-processing

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K

Stable

Jupyter Notebook

React

#asr#speaker-diarization#speech-recognition

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K

Stable

Desktop Model Runners

AI Voice & Speech

Whisper

#speech-to-text#whisper#faster-whisper

modelscope/3D-Speaker

A library for single- and multi-modal speaker verification, recognition, and diarization.

2.8K

Stable

Python

Computer Vision

AI Voice & Speech

Python

#speaker-verification#speaker-recognition#speaker-diarization

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K

Stable

Python

AI Voice & Speech

CLI Tools

PyTorch

#speech-recognition#multilingual#transformers

juanmc2005/diart

A Python package for building real-time, AI-powered audio applications like speaker diarization and voice activity detection.

1.9K

Experimental

Python

AI Voice & Speech

API Frameworks

#real-time#speaker-diarization#speaker-embedding

wq2012/awesome-diarization

A curated list of resources for speaker diarization, a speech processing task to identify who spoke when.

1.9K

Experimental

Speech Processing

Awesome Lists

#speaker-diarization#speech-recognition#machine-learning

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K

Active

Swift

AI Voice & Speech

iOS

Swift

#text-to-speech#speech-to-text#voice-activity-detection

google/uis-rnn

A library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, for supervised speaker diarization.

1.6K

Archived

Python

LLM Frameworks

API Frameworks

Python

#clustering#machine-learning#speaker-diarization

wenet-e2e/wespeaker

A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.

1.2K

Active

Python

Speech & Voice

API Frameworks

PyTorch

#speech-recognition#speaker-verification#speaker-diarization

Stay in the loop

Get weekly updates on trending AI coding tools and projects.