Showing 41-49 of 49 projects
SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.
Real-time voice interactive digital human with customizable appearance and voice, supporting voice cloning.
Open-source PyTorch implementation of an end-to-end automatic speech recognition (ASR) system.
Fine-tune and deploy the Whisper speech recognition model with accelerated inference and support for various platforms.
A Java-based framework for building AI-powered productivity tools, including chatbots, drawing, knowledge management, and more.
An open-source speech interaction system that integrates ASR, LLM, and TTS models for voice-based applications.
Unofficial PyTorch implementation of the Conformer model for speech recognition tasks.
A Python wrapper for Kaldi speech recognition and feature extraction library.
Offline speech recognition for Android using the Vosk library, a popular open-source speech recognition toolkit.
Get weekly updates on trending AI coding tools and projects.