Explore Projects

Discover 13 open source projects

Active filters (1):

Search: kaldi×

Clear all

Showing 1-13 of 13 projects

kaldi-asr/kaldi

An open-source speech recognition toolkit used for building speech recognition systems.

15.3K

Stable

Shell

Speech Recognition

#speech-recognition#speaker-identification#speaker-verification

alphacep/vosk-api

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

14.3K

Stable

Jupyter Notebook

AI Voice & Speech

Node

#speech-recognition#voice-recognition#offline

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K

Active

C++

AI Voice & Speech

#speech-to-text#text-to-speech#offline

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K

Active

Python

Speech & Voice

PyTorch

#speech-recognition#speech-synthesis#speech-translation

mravanelli/pytorch-kaldi

A PyTorch-based project for developing state-of-the-art speech recognition systems using the Kaldi toolkit.

2.4K

Archived

Python

Speech Recognition

API Frameworks

PyTorch

#speech-recognition#deep-learning#kaldi

MontrealCorpusTools/Montreal-Forced-Aligner

Command line tool for forced alignment using the Kaldi speech recognition toolkit.

1.8K

Active

Python

Speech & Audio

CLI Tools

Python

#speech-recognition#forced-alignment#kaldi

k2-fsa/sherpa-ncnn

Real-time speech recognition and voice activity detection for offline use on multiple platforms.

1.6K

Stable

C++

AI Voice & Speech

Cross-Platform

#speech-recognition#voice-activity-detection#offline

DragonComputer/Dragonfire

An open-source virtual assistant for Ubuntu-based Linux distributions, focused on speech recognition and natural language processing.

1.4K

Archived

Python

AI Voice & Speech

API Frameworks

#speech-recognition#natural-language-processing#ubuntu

alphacep/vosk-server

A speech recognition server based on Vosk and Kaldi libraries, supporting WebSocket, gRPC, and WebRTC protocols.

1.2K

Experimental

Python

AI Voice & Speech

BaaS Platforms

Python

#speech-recognition#asr#kaldi

lhotse-speech/lhotse

Lhotse is a set of tools for handling multimodal data in machine learning projects, with a focus on speech and audio.

1.1K

Active

Python

Speech & Voice

Data Pipelines

PyTorch

#speech-recognition#audio-processing#data-handling

alumae/kaldi-gstreamer-server

A real-time speech recognition server built with the Kaldi toolkit and GStreamer framework.

1.1K

Archived

Python

AI Voice & Speech

#speech-recognition#real-time#open-source

pykaldi/pykaldi

A Python wrapper for Kaldi speech recognition and feature extraction library.

1.0K

Stable

Python

Kaldi

React

#speech-recognition#feature-extraction#kaldi

alphacep/vosk-android-demo

Offline speech recognition for Android using the Vosk library, a popular open-source speech recognition toolkit.

1.0K

Stable

Java

AI Voice & Speech

Android

#speech-recognition#offline#android

Stay in the loop

Get weekly updates on trending AI coding tools and projects.