Explore Projects

Discover 17 open source projects

Active filters (1):
Search: speech-processingร—
Clear all

Showing 1-17 of 17 projects

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K
Active
Python
AI SDKs & Wrappers
PyTorch
#speech-recognition#audio-processing#deep-learning

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K
Active
Jupyter Notebook
Speech Processing
API Frameworks
PyTorch
#speech-recognition#speaker-diarization#audio-processing

snakers4/silero-vad

Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.

8.4K
Stable
Python
AI Voice & Speech
PyTorch
#speech-processing#voice-activity-detection#voice-commands

pliang279/awesome-multimodal-ml

A comprehensive reading list for research topics in multimodal machine learning.

6.8K
Archived
Computer Vision
Natural Language Processing
#multimodal-learning#reading-list#machine-learning

microsoft/torchscale

Foundation Architecture for (M)LLMs, a powerful toolkit for building large language models.

3.1K
Archived
Python
LLM Frameworks
API Frameworks
Python
#large-language-models#transformer#multimodal

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K
Stable
Python
AI Voice & Speech
CLI Tools
PyTorch
#speech-recognition#multilingual#transformers

r9y9/wavenet_vocoder

A high-quality open-source PyTorch implementation of the WaveNet vocoder, a neural network for speech synthesis.

2.4K
Archived
Python
AI Voice & Speech
API Frameworks
PyTorch
#speech-synthesis#neural-vocoder#wavenet

resemble-ai/resemble-enhance

An AI-powered speech denoising and enhancement library for improving audio quality.

2.2K
Archived
Python
AI Voice & Speech
Python
#denoise#speech-denoising#speech-enhancement

DigitalPhonetics/IMS-Toucan

A fast and controllable text-to-speech library supporting over 7000 languages using deep learning and PyTorch.

2.2K
Active
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#speech-synthesis#deep-learning

TEN-framework/ten-vad

A lightweight, high-performance voice activity detector (VAD) library for conversational AI and real-time speech processing.

2.0K
Stable
C
AI Voice & Speech
API Frameworks
C
#audio#speech-processing#real-time

r9y9/deepvoice3_pytorch

A PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models.

2.0K
Archived
Python
Speech-to-Text
Speech-Synthesis
PyTorch
#speech-processing#text-to-speech#multi-speaker

wq2012/awesome-diarization

A curated list of resources for speaker diarization, a speech processing task to identify who spoke when.

1.9K
Experimental
Speech Processing
Awesome Lists
#speaker-diarization#speech-recognition#machine-learning

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K
Archived
AI Voice & Speech
Databases
#speech-recognition#speech-synthesis#speech-processing

haoheliu/voicefixer

A Python library for speech restoration, including tasks like declipping, denoising, and dereverberation.

1.3K
Experimental
Python
AI Voice & Speech
Signal Processing
Python
#speech-enhancement#audio-processing#signal-processing

ictnlp/StreamSpeech

An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.

1.2K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#speech-translation#speech-synthesis

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.

1.2K
Archived
Python
Audio & Speech
Signal Processing
PyTorch
#audio-processing#speech-recognition#signal-processing

midas-research/audino

Open-source audio annotation tool for machine learning and speech processing datasets.

1.1K
Stable
TypeScript
Speech Processing
Datasets
TypeScript
#audio-annotation#speech-processing#machine-learning

Stay in the loop

Get weekly updates on trending AI coding tools and projects.