Explore Projects

Discover 84 open source projects

Active filters (1):

Search: speech-recognition×

Clear all

Showing 1-20 of 84 projects

huggingface/transformers

Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.

157.4K

Active

Python

LLM Frameworks

Agents & Orchestration

PyTorch

#transformers#huggingface#deep-learning

ggml-org/whisper.cpp

High-performance C/C++ port of OpenAI's Whisper for speech recognition

47.2K

Active

C++

Inference

CLI Tools

#speech-to-text#c++#inference

mozilla/DeepSpeech

Offline speech-to-text engine for real-time on-device use

26.7K

Experimental

C++

AI Voice & Speech

Documentation

TensorFlow

#speech-to-text#deep-learning#tensorflow

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K

Stable

Python

Inference

Local Inference Engines

CTranslate2

#speech-to-text#inference#quantization

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K

Active

Python

AI Voice & Speech

Python

#asr#speech-to-text#diarization

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K

Active

TypeScript

AI Voice & Speech

Node

#open-source#virtual-assistant#speech-to-text

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K

Stable

Shell

Speech Recognition

#speech-recognition#speaker-identification#speaker-verification

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K

Active

Python

AI Voice & Speech

PyTorch

#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

NVIDIA/DeepLearningExamples

A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.

14.7K

Archived

Jupyter Notebook

ML Ops

PyTorch

#deep-learning#computer-vision#natural-language-processing

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K

Stable

Jupyter Notebook

AI Voice & Speech

Node

#speech-recognition#voice-recognition#offline

kmario23/deep-learning-drizzle

A comprehensive collection of deep learning, reinforcement learning, and machine learning resources for vibe coders.

12.8K

Archived

HTML

Machine Learning

#deep-learning#machine-learning#computer-vision

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-translation

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K

Active

Python

AI SDKs & Wrappers

PyTorch

#speech-recognition#audio-processing#deep-learning

openvinotoolkit/openvino

OpenVINO is an open-source toolkit for optimizing and deploying AI inference on a variety of hardware.

9.8K

Active

C++

Inference

#ai#computer-vision#deep-learning

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K

Active

Python

Speech & Voice

PyTorch

#speech-recognition#speech-synthesis#speech-translation

Uberi/speech_recognition

Python speech recognition library supporting multiple engines and APIs, both online and offline.

9.0K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-to-text#audio

nl8590687/ASRT_SpeechRecognition

A deep learning-based Chinese speech recognition system for developers working on AI-powered speech applications.

8.4K

Stable

Python

AI Voice & Speech

API Frameworks

TensorFlow

#speech-recognition#speech-to-text#chinese

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K

Stable

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-recognition#speech-emotion-recognition#audio-event-classification

TalAter/annyang

Speech recognition library for your web application, enabling voice interactions.

6.7K

Archived

JavaScript

Component Libraries (React)

AI Voice & Speech

React

#speech#speech-recognition#voice

flashlight/wav2letter

Facebook AI Research's end-to-end speech recognition toolkit written in C++.

6.4K

Active

C++

Speech Recognition

#speech-recognition#deep-learning#end-to-end

2 3 4 5

Stay in the loop

Get weekly updates on trending AI coding tools and projects.