Explore Projects

Discover 49 open source projects

Active filters (1):

Search: asr×

Clear all

Showing 1-20 of 49 projects

fighting41love/funNLP

Comprehensive Chinese NLP resource collection for developers

79.2K

Archived

Python

LLM Frameworks

RAG & Vector

Python

#nlp#chinese-nlp#ai-resources

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K

Active

Python

AI Voice & Speech

Python

#asr#speech-to-text#diarization

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers

16.9K

Active

Python

React

#generative-ai#machine-learning#neural-networks

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K

Stable

Shell

Speech Recognition

#speech-recognition#speaker-identification#speaker-verification

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K

Stable

Jupyter Notebook

AI Voice & Speech

Node

#speech-recognition#voice-recognition#offline

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-translation

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K

Active

Python

AI SDKs & Wrappers

PyTorch

#speech-recognition#audio-processing#deep-learning

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K

Active

C++

AI Voice & Speech

#speech-to-text#text-to-speech#offline

Const-me/Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

10.2K

Archived

C++

Inference

#speech-recognition#whisper#asr

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K

Stable

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-recognition#speech-emotion-recognition#audio-event-classification

wzpan/wukong-robot

An open-source Chinese voice assistant project that supports ChatGPT-like conversational abilities and brain-computer interface integration.

7.1K

Archived

Python

AI Voice & Speech

Agents & Orchestration

Python

#chatgpt#voice-assistant#brain-computer-interface

moonshine-ai/moonshine

Fast and accurate automatic speech recognition (ASR) for edge devices

7.0K

Stable

Python

AI Voice & Speech

#speech-recognition#edge-computing#audio-processing

jdepoix/youtube-transcript-api

A Python API to get YouTube video transcripts without an API key or headless browser

7.0K

Active

Python

API Clients & Testing

Video

#youtube#transcripts#captions

xiangyuecn/Recorder

HTML5 JavaScript recording library that supports multiple audio formats and provides features like ASR and DTMF.

5.6K

Experimental

JavaScript

Recording & Streaming

Speech & Voice

#audio#recording#webrtc

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K

Stable

Jupyter Notebook

React

#asr#speaker-diarization#speech-recognition

PeterH0323/Streamer-Sales

A large language model-powered virtual salesperson that can generate product descriptions to drive user purchases.

3.6K

Experimental

Python

LLM Frameworks

Agents & Orchestration

FastAPI

#llm#virtual-salesperson#product-descriptions

umlx5h/LLPlayer

A media player for language learning with AI-powered features like dual subtitles, real-time translation, and more.

3.3K

Active

LLM Wrappers & SDKs

Media Players

WPF

#language-learning#media-player#ai-subtitles

ahmetoner/whisper-asr-webservice

A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.

3.2K

Stable

Python

AI Voice & Speech

API Clients & Testing

Flask

#speech-recognition#automatic-speech-recognition#openai-whisper

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.

3.1K

Archived

AI Voice & Speech

#speech-recognition#speech-synthesis#language-modeling

CheshireCC/faster-whisper-GUI

A GUI for the faster-whisper library, which enables fast, open-source speech transcription using OpenAI's Whisper model.

2.9K

Archived

Python

AI Voice & Speech

AI App Builders

PySide6

#speech-transcription#openai-whisper#voice-activity-detection

2 3

Stay in the loop

Get weekly updates on trending AI coding tools and projects.