Showing 1-20 of 84 projects
Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.
High-performance C/C++ port of OpenAI's Whisper for speech recognition
Offline speech-to-text engine for real-time on-device use
Faster Whisper transcription with CTranslate2 for efficient speech-to-text
WhisperX for fast ASR with word-level timestamps and diarization
An open-source personal assistant that provides AI-powered voice and text interactions.
An open-source speech recognition toolkit used for building speech recognition systems.
A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.
A collection of state-of-the-art deep learning scripts for various AI/ML tasks, easily trainable and deployable.
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
A comprehensive collection of deep learning, reinforcement learning, and machine learning resources for vibe coders.
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.
OpenVINO is an open-source toolkit for optimizing and deploying AI inference on a variety of hardware.
End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.
Python speech recognition library supporting multiple engines and APIs, both online and offline.
A deep learning-based Chinese speech recognition system for developers working on AI-powered speech applications.
A multilingual voice understanding model for AI-powered audio analysis and transcription.
Speech recognition library for your web application, enabling voice interactions.
Facebook AI Research's end-to-end speech recognition toolkit written in C++.
Get weekly updates on trending AI coding tools and projects.