Explore Projects

Discover 84 open source projects

Active filters (1):
Search: speech-recognitionร—
Clear all

Showing 1-20 of 84 projects

huggingface/transformers

Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.

157.4K
Active
Python
LLM Frameworks
Agents & Orchestration
PyTorch
#transformers#huggingface#deep-learning

ggml-org/whisper.cpp

High-performance C/C++ port of OpenAI's Whisper for speech recognition

47.2K
Active
C++
Inference
CLI Tools
#speech-to-text#c++#inference

mozilla/DeepSpeech

Offline speech-to-text engine for real-time on-device use

26.7K
Experimental
C++
AI Voice & Speech
Documentation
TensorFlow
#speech-to-text#deep-learning#tensorflow

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K
Stable
Python
Inference
Local Inference Engines
CTranslate2
#speech-to-text#inference#quantization

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K
Active
Python
AI Voice & Speech
Python
#asr#speech-to-text#diarization

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K
Active
TypeScript
AI Voice & Speech
Node
#open-source#virtual-assistant#speech-to-text

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K
Stable
Shell
Speech Recognition
#speech-recognition#speaker-identification#speaker-verification

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

NVIDIA/DeepLearningExamples

A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.

14.7K
Archived
Jupyter Notebook
ML Ops
PyTorch
#deep-learning#computer-vision#natural-language-processing

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K
Stable
Jupyter Notebook
AI Voice & Speech
Node
#speech-recognition#voice-recognition#offline

kmario23/deep-learning-drizzle

A comprehensive collection of deep learning, reinforcement learning, and machine learning resources for vibe coders.

12.8K
Archived
HTML
Machine Learning
#deep-learning#machine-learning#computer-vision

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-translation

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K
Active
Python
AI SDKs & Wrappers
PyTorch
#speech-recognition#audio-processing#deep-learning

openvinotoolkit/openvino

OpenVINO is an open-source toolkit for optimizing and deploying AI inference on a variety of hardware.

9.8K
Active
C++
Inference
#ai#computer-vision#deep-learning

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

Uberi/speech_recognition

Python speech recognition library supporting multiple engines and APIs, both online and offline.

9.0K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-to-text#audio

nl8590687/ASRT_SpeechRecognition

A deep learning-based Chinese speech recognition system for developers working on AI-powered speech applications.

8.4K
Stable
Python
AI Voice & Speech
API Frameworks
TensorFlow
#speech-recognition#speech-to-text#chinese

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K
Stable
Python
AI Voice & Speech
API Frameworks
PyTorch
#speech-recognition#speech-emotion-recognition#audio-event-classification

TalAter/annyang

Speech recognition library for your web application, enabling voice interactions.

6.7K
Archived
JavaScript
Component Libraries (React)
AI Voice & Speech
React
#speech#speech-recognition#voice

flashlight/wav2letter

Facebook AI Research's end-to-end speech recognition toolkit written in C++.

6.4K
Active
C++
Speech Recognition
#speech-recognition#deep-learning#end-to-end

Stay in the loop

Get weekly updates on trending AI coding tools and projects.