Explore Projects

Discover 69 open source projects

Active filters (1):
Search: speech-to-textร—
Clear all

Showing 1-20 of 69 projects

ggml-org/whisper.cpp

High-performance C/C++ port of OpenAI's Whisper for speech recognition

47.2K
Active
C++
Inference
CLI Tools
#speech-to-text#c++#inference

mozilla/DeepSpeech

Offline speech-to-text engine for real-time on-device use

26.7K
Experimental
C++
AI Voice & Speech
Documentation
TensorFlow
#speech-to-text#deep-learning#tensorflow

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K
Stable
Python
Inference
Local Inference Engines
CTranslate2
#speech-to-text#inference#quantization

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K
Active
Python
AI Voice & Speech
Python
#asr#speech-to-text#diarization

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K
Active
TypeScript
AI Voice & Speech
Node
#open-source#virtual-assistant#speech-to-text

cjpais/Handy

A free, open-source, and extensible speech-to-text application that works offline.

16.9K
Active
TypeScript
React
#speech-to-text#accessibility#offline

jianchang512/pyvideotrans

A Python library that translates videos from one language to another, with support for dubbing and subtitles.

16.4K
Active
Python
AI Voice & Speech
#speech-to-text#text-to-speech#video-translation

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K
Stable
Shell
Speech Recognition
#speech-recognition#speaker-identification#speaker-verification

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K
Stable
Jupyter Notebook
AI Voice & Speech
Node
#speech-recognition#voice-recognition#offline

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K
Active
Python
AI SDKs & Wrappers
PyTorch
#speech-recognition#audio-processing#deep-learning

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K
Active
C++
AI Voice & Speech
#speech-to-text#text-to-speech#offline

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K
Active
Rust
LLM Frameworks
#ai-meeting-assistant#transcription#speaker-diarization

kyutai-labs/moshi

Moshi is an open-source speech-to-text foundation model and dialogue framework for building AI-powered voice apps.

9.8K
Active
Python
LLM Frameworks
Python
#speech-to-text#dialogue-framework#audio-codec

QuentinFuxa/WhisperLiveKit

A simultaneous speech-to-text model powered by the Whisper AI library for real-time transcription.

9.8K
Active
Python
AI Voice & Speech
Python
#speech-to-text#transcription#whisper

KoljaB/RealtimeSTT

A robust, efficient, and low-latency speech-to-text library with advanced voice activity detection and wake word activation.

9.5K
Experimental
Python
AI Voice & Speech
#realtime#speech-to-text#voice-activity-detection

Uberi/speech_recognition

Python speech recognition library supporting multiple engines and APIs, both online and offline.

9.0K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-to-text#audio

nl8590687/ASRT_SpeechRecognition

A deep learning-based Chinese speech recognition system for developers working on AI-powered speech applications.

8.4K
Stable
Python
AI Voice & Speech
API Frameworks
TensorFlow
#speech-recognition#speech-to-text#chinese

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K
Stable
Python
AI Voice & Speech
API Frameworks
PyTorch
#speech-recognition#speech-emotion-recognition#audio-event-classification

TalAter/annyang

Speech recognition library for your web application, enabling voice interactions.

6.7K
Archived
JavaScript
Component Libraries (React)
AI Voice & Speech
React
#speech#speech-recognition#voice

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K
Active
Python
AI Voice & Speech
CLI Tools
Apple MLX
#apple-silicon#speech-recognition#speech-synthesis

Stay in the loop

Get weekly updates on trending AI coding tools and projects.