Explore Projects

Discover 13 open source projects

Active filters (1):
Search: kaldiร—
Clear all

Showing 1-13 of 13 projects

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K
Stable
Shell
Speech Recognition
#speech-recognition#speaker-identification#speaker-verification

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K
Stable
Jupyter Notebook
AI Voice & Speech
Node
#speech-recognition#voice-recognition#offline

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K
Active
C++
AI Voice & Speech
#speech-to-text#text-to-speech#offline

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

mravanelli/pytorch-kaldi

A PyTorch-based project for developing state-of-the-art speech recognition systems using the Kaldi toolkit.

2.4K
Archived
Python
Speech Recognition
API Frameworks
PyTorch
#speech-recognition#deep-learning#kaldi

MontrealCorpusTools/Montreal-Forced-Aligner

Command line tool for forced alignment using the Kaldi speech recognition toolkit.

1.8K
Active
Python
Speech & Audio
CLI Tools
Python
#speech-recognition#forced-alignment#kaldi

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection for offline use on multiple platforms.

1.6K
Stable
C++
AI Voice & Speech
Cross-Platform
#speech-recognition#voice-activity-detection#offline

DragonComputer/Dragonfire

An open-source virtual assistant for Ubuntu-based Linux distributions, focused on speech recognition and natural language processing.

1.4K
Archived
Python
AI Voice & Speech
API Frameworks
#speech-recognition#natural-language-processing#ubuntu

alphacep/vosk-server

A speech recognition server based on Vosk and Kaldi libraries, supporting WebSocket, gRPC, and WebRTC protocols.

1.2K
Experimental
Python
AI Voice & Speech
BaaS Platforms
Python
#speech-recognition#asr#kaldi

lhotse-speech/lhotse

Lhotse is a set of tools for handling multimodal data in machine learning projects, with a focus on speech and audio.

1.1K
Active
Python
Speech & Voice
Data Pipelines
PyTorch
#speech-recognition#audio-processing#data-handling

alumae/kaldi-gstreamer-server

A real-time speech recognition server built with the Kaldi toolkit and GStreamer framework.

1.1K
Archived
Python
AI Voice & Speech
#speech-recognition#real-time#open-source

pykaldi/pykaldi

A Python wrapper for Kaldi speech recognition and feature extraction library.

1.0K
Stable
Python
Kaldi
React
#speech-recognition#feature-extraction#kaldi

alphacep/vosk-android-demo

Offline speech recognition for Android using the Vosk library, a popular open-source speech recognition toolkit.

1.0K
Stable
Java
AI Voice & Speech
Android
#speech-recognition#offline#android

Stay in the loop

Get weekly updates on trending AI coding tools and projects.