Explore Projects

Discover 69 open source projects

Active filters (1):

Search: speech-to-text×

Clear all

Showing 1-20 of 69 projects

ggml-org/whisper.cpp

High-performance C/C++ port of OpenAI's Whisper for speech recognition

47.2K

Active

C++

Inference

CLI Tools

#speech-to-text#c++#inference

mozilla/DeepSpeech

Offline speech-to-text engine for real-time on-device use

26.7K

Experimental

C++

AI Voice & Speech

Documentation

TensorFlow

#speech-to-text#deep-learning#tensorflow

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K

Stable

Python

Inference

Local Inference Engines

CTranslate2

#speech-to-text#inference#quantization

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K

Active

Python

AI Voice & Speech

Python

#asr#speech-to-text#diarization

leon-ai/leon

An open-source personal assistant that provides AI-powered voice and text interactions.

17.0K

Active

TypeScript

AI Voice & Speech

Node

#open-source#virtual-assistant#speech-to-text

cjpais/Handy

A free, open-source, and extensible speech-to-text application that works offline.

16.9K

Active

TypeScript

React

#speech-to-text#accessibility#offline

jianchang512/pyvideotrans

A Python library that translates videos from one language to another, with support for dubbing and subtitles.

16.4K

Active

Python

AI Voice & Speech

#speech-to-text#text-to-speech#video-translation

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K

Stable

Shell

Speech Recognition

#speech-recognition#speaker-identification#speaker-verification

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K

Stable

Jupyter Notebook

AI Voice & Speech

Node

#speech-recognition#voice-recognition#offline

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K

Active

Python

AI SDKs & Wrappers

PyTorch

#speech-recognition#audio-processing#deep-learning

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K

Active

C++

AI Voice & Speech

#speech-to-text#text-to-speech#offline

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K

Active

Rust

LLM Frameworks

#ai-meeting-assistant#transcription#speaker-diarization

kyutai-labs/moshi

Moshi is an open-source speech-to-text foundation model and dialogue framework for building AI-powered voice apps.

9.8K

Active

Python

LLM Frameworks

Python

#speech-to-text#dialogue-framework#audio-codec

QuentinFuxa/WhisperLiveKit

A simultaneous speech-to-text model powered by the Whisper AI library for real-time transcription.

9.8K

Active

Python

AI Voice & Speech

Python

#speech-to-text#transcription#whisper

KoljaB/RealtimeSTT

A robust, efficient, and low-latency speech-to-text library with advanced voice activity detection and wake word activation.

9.5K

Experimental

Python

AI Voice & Speech

#realtime#speech-to-text#voice-activity-detection

Uberi/speech_recognition

Python speech recognition library supporting multiple engines and APIs, both online and offline.

9.0K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-to-text#audio

nl8590687/ASRT_SpeechRecognition

A deep learning-based Chinese speech recognition system for developers working on AI-powered speech applications.

8.4K

Stable

Python

AI Voice & Speech

API Frameworks

TensorFlow

#speech-recognition#speech-to-text#chinese

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K

Stable

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-recognition#speech-emotion-recognition#audio-event-classification

TalAter/annyang

Speech recognition library for your web application, enabling voice interactions.

6.7K

Archived

JavaScript

Component Libraries (React)

AI Voice & Speech

React

#speech#speech-recognition#voice

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K

Active

Python

AI Voice & Speech

CLI Tools

Apple MLX

#apple-silicon#speech-recognition#speech-synthesis

2 3 4

Stay in the loop

Get weekly updates on trending AI coding tools and projects.