Showing 181-200 of 368 projects
FastSpeech 2 implementation for high-quality end-to-end text-to-speech
A React Native library for voice recognition on iOS and Android, with online and offline support.
An open-source library for building speech recognition models using the DeepSpeech2 architecture.
An open-source, privacy-first desktop voice assistant that integrates local speech recognition and configurable language models.
A lightweight, high-performance voice activity detector (VAD) library for conversational AI and real-time speech processing.
A PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models.
An open-source text-to-speech tool supporting long-form text and multi-voice narration.
Mandarin Automatic Speech Recognition system using Python
A lightweight, open-source translator that supports multiple translation engines and features like OCR and text-to-speech.
Open-source large vocabulary continuous speech recognition engine for various applications.
A multi-functional OCR tool for text recognition, translation, text-to-speech, manga translation, and more.
Cross-modal lip reading using 3D convolutional neural networks for speech recognition.
NCRF++: A Neural Sequence Labeling Toolkit for tasks like NER, POS tagging, and text chunking.
A developer-focused platform for text-to-speech, RAG, and LLMs, with local-first architecture.
The Alan AI SDK for iOS enables developers to add voice AI and conversational interfaces to their mobile apps.
A real-time speech enhancement model that runs on a laptop CPU, useful for AI audio processing.
An open-source project towards developing a GPT-4-based AI assistant with vision, speech, and duplex capabilities.
An open-source library for tracking the state of the art and recent results in speech recognition research.
An open-source software for doing phonetics by computer, focused on speech analysis.
A TypeScript library for a voice activity detector (VAD) with a simple API for the browser.
Get weekly updates on trending AI coding tools and projects.