Explore Projects

Discover 8 open source projects

Active filters (1):

Search: automatic-speech-recognition×

Showing 1-8 of 8 projects

ahmetoner/whisper-asr-webservice

A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.

3.2K

Stable

Python

AI Voice & Speech

API Clients & Testing

Flask

#speech-recognition#automatic-speech-recognition#openai-whisper

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.

3.1K

Archived

AI Voice & Speech

#speech-recognition#speech-synthesis#language-modeling

coqui-ai/STT

An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.

2.6K

Archived

C++

Speech Recognition

API Frameworks

TensorFlow

#speech-recognition#deep-learning#asr

TEN-framework/ten-vad

A lightweight, high-performance voice activity detector (VAD) library for conversational AI and real-time speech processing.

2.0K

Stable

AI Voice & Speech

API Frameworks

#audio#speech-processing#real-time

FireRedTeam/FireRedASR

Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, with outstanding singing lyrics recognition.

1.8K

Active

Python

Speech Recognition

API Frameworks

Python

#asr#speech-recognition#conformer

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K

Active

Swift

AI Voice & Speech

iOS

Swift

#text-to-speech#speech-to-text#voice-activity-detection

kakaobrain/pororo

PORORO is a powerful Python library that provides a wide range of neural models for natural language processing tasks.

1.3K

Archived

Python

LLM Frameworks

AI Voice & Speech

Python

#natural-language-processing#speech-recognition#speech-synthesis

TensorSpeech/TensorFlowASR

TensorFlowASR is an almost state-of-the-art automatic speech recognition library in TensorFlow 2 for vibe coders.

1.0K

Experimental

Python

Speech Recognition

Caching

TensorFlow

#automatic-speech-recognition#speech-to-text#streaming

Stay in the loop

Get weekly updates on trending AI coding tools and projects.