Explore Projects

Discover 84 open source projects

Active filters (1):
Search: speech-recognitionร—
Clear all

Showing 21-40 of 84 projects

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K
Active
Python
AI Voice & Speech
CLI Tools
Apple MLX
#apple-silicon#speech-recognition#speech-synthesis

PaddlePaddle/PaddleX

PaddleX is an all-in-one development tool based on PaddlePaddle, providing AI pipelines for computer vision, NLP, and more.

6.1K
Active
Python
Computer Vision
Natural Language Processing
Python
#computer-vision#natural-language-processing#ocr

argmaxinc/WhisperKit

An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.

5.7K
Active
Swift
AI Voice & Speech
iOS
#speech-recognition#transformers#inference

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K
Stable
Jupyter Notebook
React
#asr#speaker-diarization#speech-recognition

modelscope/FunClip

Open-source video speech recognition & clipping tool with LLM-based AI clipping capabilities

5.4K
Experimental
Python
LLM Frameworks
AI Voice & Speech
gradio
#speech-recognition#video-subtitles#llm

Picovoice/porcupine

Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.

4.7K
Active
Python
AI Voice & Speech
CLI Tools
Python
#speech-recognition#voice-activation#wake-word-detection

sanchit-gandhi/whisper-jax

A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.

4.7K
Archived
Jupyter Notebook
LLM Frameworks
Speech-to-Text
JAX
#speech-recognition#whisper#jax

cmusphinx/pocketsphinx

A small speech recognition library written in C that can be used in a variety of applications.

4.3K
Active
C
AI Voice & Speech
#speech-recognition#c#voice

ahmetoner/whisper-asr-webservice

A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.

3.2K
Stable
Python
AI Voice & Speech
API Clients & Testing
Flask
#speech-recognition#automatic-speech-recognition#openai-whisper

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.

3.1K
Archived
AI Voice & Speech
#speech-recognition#speech-synthesis#language-modeling

HeyWillow/willow

Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools

3.0K
Experimental
C
AI Voice & Speech
AI App Builders
ESP-IDF
#alexa#google-home#speech-recognition

chenyme/Chenyme-AAVT

This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.

2.9K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#video-translation#whisper

pluja/whishper

A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.

2.9K
Stable
Svelte
AI Voice & Speech
Frontend Frameworks
Svelte
#speech-recognition#speech-to-text#transcription

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K
Stable
Desktop Model Runners
AI Voice & Speech
Whisper
#speech-to-text#whisper#faster-whisper

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K
Stable
Python
AI Voice & Speech
CLI Tools
PyTorch
#speech-recognition#multilingual#transformers

rhasspy/rhasspy

Offline private voice assistant for many human languages, built with privacy and security in mind.

2.7K
Experimental
Shell
API Frameworks
AI Voice & Speech
Node
#voice-assistant#speech-recognition#privacy

coqui-ai/STT

An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.

2.6K
Archived
C++
Speech Recognition
API Frameworks
TensorFlow
#speech-recognition#deep-learning#asr

mravanelli/pytorch-kaldi

A PyTorch-based project for developing state-of-the-art speech recognition systems using the Kaldi toolkit.

2.4K
Archived
Python
Speech Recognition
API Frameworks
PyTorch
#speech-recognition#deep-learning#kaldi

pannous/tensorflow-speech-recognition

A deep learning-based speech recognition library built on TensorFlow for developers working with AI-powered audio apps.

2.2K
Archived
Python
Speech Recognition
API Frameworks
TensorFlow
#speech-recognition#audio-processing#deep-learning

react-native-voice/voice

A React Native library for voice recognition on iOS and Android, with online and offline support.

2.2K
Active
TypeScript
React Native
AI Voice & Speech
React Native
#voice-recognition#speech-recognition#android

Stay in the loop

Get weekly updates on trending AI coding tools and projects.