Explore Projects

Discover 84 open source projects

Active filters (1):

Search: speech-recognition×

Clear all

Showing 21-40 of 84 projects

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K

Active

Python

AI Voice & Speech

CLI Tools

Apple MLX

#apple-silicon#speech-recognition#speech-synthesis

PaddlePaddle/PaddleX

PaddleX is an all-in-one development tool based on PaddlePaddle, providing AI pipelines for computer vision, NLP, and more.

6.1K

Active

Python

Computer Vision

Natural Language Processing

Python

#computer-vision#natural-language-processing#ocr

argmaxinc/WhisperKit

An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.

5.7K

Active

Swift

AI Voice & Speech

iOS

#speech-recognition#transformers#inference

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K

Stable

Jupyter Notebook

React

#asr#speaker-diarization#speech-recognition

modelscope/FunClip

Open-source video speech recognition & clipping tool with LLM-based AI clipping capabilities

5.4K

Experimental

Python

LLM Frameworks

AI Voice & Speech

gradio

#speech-recognition#video-subtitles#llm

Picovoice/porcupine

Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.

4.7K

Active

Python

AI Voice & Speech

CLI Tools

Python

#speech-recognition#voice-activation#wake-word-detection

sanchit-gandhi/whisper-jax

A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.

4.7K

Archived

Jupyter Notebook

LLM Frameworks

Speech-to-Text

JAX

#speech-recognition#whisper#jax

cmusphinx/pocketsphinx

A small speech recognition library written in C that can be used in a variety of applications.

4.3K

Active

AI Voice & Speech

#speech-recognition#c#voice

ahmetoner/whisper-asr-webservice

A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.

3.2K

Stable

Python

AI Voice & Speech

API Clients & Testing

Flask

#speech-recognition#automatic-speech-recognition#openai-whisper

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.

3.1K

Archived

AI Voice & Speech

#speech-recognition#speech-synthesis#language-modeling

HeyWillow/willow

Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools

3.0K

Experimental

AI Voice & Speech

AI App Builders

ESP-IDF

#alexa#google-home#speech-recognition

chenyme/Chenyme-AAVT

This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.

2.9K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#speech-recognition#video-translation#whisper

pluja/whishper

A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.

2.9K

Stable

Svelte

AI Voice & Speech

Frontend Frameworks

Svelte

#speech-recognition#speech-to-text#transcription

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K

Stable

Desktop Model Runners

AI Voice & Speech

Whisper

#speech-to-text#whisper#faster-whisper

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K

Stable

Python

AI Voice & Speech

CLI Tools

PyTorch

#speech-recognition#multilingual#transformers

rhasspy/rhasspy

Offline private voice assistant for many human languages, built with privacy and security in mind.

2.7K

Experimental

Shell

API Frameworks

AI Voice & Speech

Node

#voice-assistant#speech-recognition#privacy

coqui-ai/STT

An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.

2.6K

Archived

C++

Speech Recognition

API Frameworks

TensorFlow

#speech-recognition#deep-learning#asr

mravanelli/pytorch-kaldi

A PyTorch-based project for developing state-of-the-art speech recognition systems using the Kaldi toolkit.

2.4K

Archived

Python

Speech Recognition

API Frameworks

PyTorch

#speech-recognition#deep-learning#kaldi

pannous/tensorflow-speech-recognition

A deep learning-based speech recognition library built on TensorFlow for developers working with AI-powered audio apps.

2.2K

Archived

Python

Speech Recognition

API Frameworks

TensorFlow

#speech-recognition#audio-processing#deep-learning

react-native-voice/voice

A React Native library for voice recognition on iOS and Android, with online and offline support.

2.2K

Active

TypeScript

React Native

AI Voice & Speech

React Native

#voice-recognition#speech-recognition#android

13 4 5

Stay in the loop

Get weekly updates on trending AI coding tools and projects.