Explore Projects

Discover 19 open source projects

Active filters (1):
Search: diarizationร—
Clear all

Showing 1-19 of 19 projects

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K
Active
Python
AI Voice & Speech
Python
#asr#speech-to-text#diarization

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K
Active
Python
AI SDKs & Wrappers
PyTorch
#speech-recognition#audio-processing#deep-learning

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K
Active
C++
AI Voice & Speech
#speech-to-text#text-to-speech#offline

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K
Active
Rust
LLM Frameworks
#ai-meeting-assistant#transcription#speaker-diarization

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K
Active
Jupyter Notebook
Speech Processing
API Frameworks
PyTorch
#speech-recognition#speaker-diarization#audio-processing

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K
Stable
Jupyter Notebook
React
#asr#speaker-diarization#speech-recognition

tmoroney/auto-subs

AI subtitle generator for video with DaVinci Resolve integration, speaker diarization, runs locally.

2.9K
Active
TypeScript
AI Voice & Speech
Desktop Model Runners
OpenAI
#ai-subtitles#davinci-resolve#speaker-diarization

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K
Stable
Desktop Model Runners
AI Voice & Speech
Whisper
#speech-to-text#whisper#faster-whisper

modelscope/3D-Speaker

A library for single- and multi-modal speaker verification, recognition, and diarization.

2.8K
Stable
Python
Computer Vision
AI Voice & Speech
Python
#speaker-verification#speaker-recognition#speaker-diarization

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K
Stable
Python
AI Voice & Speech
CLI Tools
PyTorch
#speech-recognition#multilingual#transformers

juanmc2005/diart

A Python package for building real-time, AI-powered audio applications like speaker diarization and voice activity detection.

1.9K
Experimental
Python
AI Voice & Speech
API Frameworks
#real-time#speaker-diarization#speaker-embedding

wq2012/awesome-diarization

A curated list of resources for speaker diarization, a speech processing task to identify who spoke when.

1.9K
Experimental
Speech Processing
Awesome Lists
#speaker-diarization#speech-recognition#machine-learning

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K
Active
Swift
AI Voice & Speech
iOS
Swift
#text-to-speech#speech-to-text#voice-activity-detection

google/uis-rnn

A library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, for supervised speaker diarization.

1.6K
Archived
Python
LLM Frameworks
API Frameworks
Python
#clustering#machine-learning#speaker-diarization

R3gm/SoniTranslate

Synchronized Translation for Videos: Automatic dubbing and subtitling for video content.

1.3K
Stable
Python
AI Voice & Speech
CMS & Content
#video-dubbing#speech-to-text#text-to-speech

wenet-e2e/wespeaker

A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.

1.2K
Active
Python
Speech & Voice
API Frameworks
PyTorch
#speech-recognition#speaker-verification#speaker-diarization

JuergenFleiss/aTrain

A Python GUI tool for offline transcription of speech recordings, including speaker diarization, using state-of-the-art machine learning models.

1.1K
Active
Python
AI Voice & Speech
CLI Tools
#speech-recognition#transcription#diarization

Stay in the loop

Get weekly updates on trending AI coding tools and projects.