Explore Projects

Discover 19 open source projects

Active filters (1):

Search: diarization×

Clear all

Showing 1-19 of 19 projects

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K

Active

Python

AI Voice & Speech

Python

#asr#speech-to-text#diarization

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K

Active

Python

AI Voice & Speech

PyTorch

#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K

Active

Python

AI SDKs & Wrappers

PyTorch

#speech-recognition#audio-processing#deep-learning

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K

Active

C++

AI Voice & Speech

#speech-to-text#text-to-speech#offline

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K

Active

Rust

LLM Frameworks

#ai-meeting-assistant#transcription#speaker-diarization

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K

Active

Python

Speech & Voice

PyTorch

#speech-recognition#speech-synthesis#speech-translation

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K

Active

Jupyter Notebook

Speech Processing

API Frameworks

PyTorch

#speech-recognition#speaker-diarization#audio-processing

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K

Stable

Jupyter Notebook

React

#asr#speaker-diarization#speech-recognition

tmoroney/auto-subs

AI subtitle generator for video with DaVinci Resolve integration, speaker diarization, runs locally.

2.9K

Active

TypeScript

AI Voice & Speech

Desktop Model Runners

OpenAI

#ai-subtitles#davinci-resolve#speaker-diarization

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K

Stable

Desktop Model Runners

AI Voice & Speech

Whisper

#speech-to-text#whisper#faster-whisper

modelscope/3D-Speaker

A library for single- and multi-modal speaker verification, recognition, and diarization.

2.8K

Stable

Python

Computer Vision

AI Voice & Speech

Python

#speaker-verification#speaker-recognition#speaker-diarization

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K

Stable

Python

AI Voice & Speech

CLI Tools

PyTorch

#speech-recognition#multilingual#transformers

juanmc2005/diart

A Python package for building real-time, AI-powered audio applications like speaker diarization and voice activity detection.

1.9K

Experimental

Python

AI Voice & Speech

API Frameworks

#real-time#speaker-diarization#speaker-embedding

wq2012/awesome-diarization

A curated list of resources for speaker diarization, a speech processing task to identify who spoke when.

1.9K

Experimental

Speech Processing

Awesome Lists

#speaker-diarization#speech-recognition#machine-learning

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K

Active

Swift

AI Voice & Speech

iOS

Swift

#text-to-speech#speech-to-text#voice-activity-detection

google/uis-rnn

A library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, for supervised speaker diarization.

1.6K

Archived

Python

LLM Frameworks

API Frameworks

Python

#clustering#machine-learning#speaker-diarization

R3gm/SoniTranslate

Synchronized Translation for Videos: Automatic dubbing and subtitling for video content.

1.3K

Stable

Python

AI Voice & Speech

CMS & Content

#video-dubbing#speech-to-text#text-to-speech

wenet-e2e/wespeaker

A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.

1.2K

Active

Python

Speech & Voice

API Frameworks

PyTorch

#speech-recognition#speaker-verification#speaker-diarization

JuergenFleiss/aTrain

A Python GUI tool for offline transcription of speech recordings, including speaker diarization, using state-of-the-art machine learning models.

1.1K

Active

Python

AI Voice & Speech

CLI Tools

#speech-recognition#transcription#diarization

Stay in the loop

Get weekly updates on trending AI coding tools and projects.