Explore Projects

Discover 49 open source projects

Active filters (1):
Search: asr×
Clear all

Showing 1-20 of 49 projects

fighting41love/funNLP

Comprehensive Chinese NLP resource collection for developers

79.2K
Archived
Python
LLM Frameworks
RAG & Vector
Python
#nlp#chinese-nlp#ai-resources

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K
Active
Python
AI Voice & Speech
Python
#asr#speech-to-text#diarization

NVIDIA-NeMo/NeMo

A scalable generative AI framework for researchers and developers

16.9K
Active
Python
React
#generative-ai#machine-learning#neural-networks

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K
Stable
Shell
Speech Recognition
#speech-recognition#speaker-identification#speaker-verification

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K
Stable
Jupyter Notebook
AI Voice & Speech
Node
#speech-recognition#voice-recognition#offline

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-translation

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K
Active
Python
AI SDKs & Wrappers
PyTorch
#speech-recognition#audio-processing#deep-learning

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K
Active
C++
AI Voice & Speech
#speech-to-text#text-to-speech#offline

Const-me/Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

10.2K
Archived
C++
Inference
#speech-recognition#whisper#asr

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K
Stable
Python
AI Voice & Speech
API Frameworks
PyTorch
#speech-recognition#speech-emotion-recognition#audio-event-classification

wzpan/wukong-robot

An open-source Chinese voice assistant project that supports ChatGPT-like conversational abilities and brain-computer interface integration.

7.1K
Archived
Python
AI Voice & Speech
Agents & Orchestration
Python
#chatgpt#voice-assistant#brain-computer-interface

moonshine-ai/moonshine

Fast and accurate automatic speech recognition (ASR) for edge devices

7.0K
Stable
Python
AI Voice & Speech
#speech-recognition#edge-computing#audio-processing

jdepoix/youtube-transcript-api

A Python API to get YouTube video transcripts without an API key or headless browser

7.0K
Active
Python
API Clients & Testing
Video
#youtube#transcripts#captions

xiangyuecn/Recorder

HTML5 JavaScript recording library that supports multiple audio formats and provides features like ASR and DTMF.

5.6K
Experimental
JavaScript
Recording & Streaming
Speech & Voice
#audio#recording#webrtc

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K
Stable
Jupyter Notebook
React
#asr#speaker-diarization#speech-recognition

PeterH0323/Streamer-Sales

A large language model-powered virtual salesperson that can generate product descriptions to drive user purchases.

3.6K
Experimental
Python
LLM Frameworks
Agents & Orchestration
FastAPI
#llm#virtual-salesperson#product-descriptions

umlx5h/LLPlayer

A media player for language learning with AI-powered features like dual subtitles, real-time translation, and more.

3.3K
Active
C#
LLM Wrappers & SDKs
Media Players
WPF
#language-learning#media-player#ai-subtitles

ahmetoner/whisper-asr-webservice

A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.

3.2K
Stable
Python
AI Voice & Speech
API Clients & Testing
Flask
#speech-recognition#automatic-speech-recognition#openai-whisper

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.

3.1K
Archived
AI Voice & Speech
#speech-recognition#speech-synthesis#language-modeling

CheshireCC/faster-whisper-GUI

A GUI for the faster-whisper library, which enables fast, open-source speech transcription using OpenAI's Whisper model.

2.9K
Archived
Python
AI Voice & Speech
AI App Builders
PySide6
#speech-transcription#openai-whisper#voice-activity-detection

Stay in the loop

Get weekly updates on trending AI coding tools and projects.