Explore Projects

Discover 17 open source projects

Active filters (1):

Search: speech-processing×

Clear all

Showing 1-17 of 17 projects

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K

Active

Python

AI SDKs & Wrappers

PyTorch

#speech-recognition#audio-processing#deep-learning

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K

Active

Jupyter Notebook

Speech Processing

API Frameworks

PyTorch

#speech-recognition#speaker-diarization#audio-processing

snakers4/silero-vad

Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.

8.4K

Stable

Python

AI Voice & Speech

PyTorch

#speech-processing#voice-activity-detection#voice-commands

pliang279/awesome-multimodal-ml

A comprehensive reading list for research topics in multimodal machine learning.

6.8K

Archived

Computer Vision

Natural Language Processing

#multimodal-learning#reading-list#machine-learning

microsoft/torchscale

Foundation Architecture for (M)LLMs, a powerful toolkit for building large language models.

3.1K

Archived

Python

LLM Frameworks

API Frameworks

Python

#large-language-models#transformer#multimodal

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K

Stable

Python

AI Voice & Speech

CLI Tools

PyTorch

#speech-recognition#multilingual#transformers

r9y9/wavenet_vocoder

A high-quality open-source PyTorch implementation of the WaveNet vocoder, a neural network for speech synthesis.

2.4K

Archived

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-synthesis#neural-vocoder#wavenet

resemble-ai/resemble-enhance

An AI-powered speech denoising and enhancement library for improving audio quality.

2.2K

Archived

Python

AI Voice & Speech

Python

#denoise#speech-denoising#speech-enhancement

DigitalPhonetics/IMS-Toucan

A fast and controllable text-to-speech library supporting over 7000 languages using deep learning and PyTorch.

2.2K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#speech-synthesis#deep-learning

TEN-framework/ten-vad

A lightweight, high-performance voice activity detector (VAD) library for conversational AI and real-time speech processing.

2.0K

Stable

AI Voice & Speech

API Frameworks

#audio#speech-processing#real-time

r9y9/deepvoice3_pytorch

A PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models.

2.0K

Archived

Python

Speech-to-Text

Speech-Synthesis

PyTorch

#speech-processing#text-to-speech#multi-speaker

wq2012/awesome-diarization

A curated list of resources for speaker diarization, a speech processing task to identify who spoke when.

1.9K

Experimental

Speech Processing

Awesome Lists

#speaker-diarization#speech-recognition#machine-learning

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K

Archived

AI Voice & Speech

Databases

#speech-recognition#speech-synthesis#speech-processing

haoheliu/voicefixer

A Python library for speech restoration, including tasks like declipping, denoising, and dereverberation.

1.3K

Experimental

Python

AI Voice & Speech

Signal Processing

Python

#speech-enhancement#audio-processing#signal-processing

ictnlp/StreamSpeech

An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.

1.2K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#speech-recognition#speech-translation#speech-synthesis

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.

1.2K

Archived

Python

Audio & Speech

Signal Processing

PyTorch

#audio-processing#speech-recognition#signal-processing

midas-research/audino

Open-source audio annotation tool for machine learning and speech processing datasets.

1.1K

Stable

TypeScript

Speech Processing

Datasets

TypeScript

#audio-annotation#speech-processing#machine-learning

Stay in the loop

Get weekly updates on trending AI coding tools and projects.