Showing 1-11 of 11 projects
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.
Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.
A nearly-live implementation of OpenAI's Whisper, a powerful speech recognition and translation tool.
A comprehensive TypeScript library for working with Chinese characters, including features like pinyin, stroke, and voice recognition.
An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.
A React Native library for voice recognition on iOS and Android, with online and offline support.
A lightweight, high-performance voice activity detector (VAD) library for conversational AI and real-time speech processing.
A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.
This project provides advanced voiceprint recognition models and data preprocessing methods using PyTorch.
Get weekly updates on trending AI coding tools and projects.