Showing 21-40 of 84 projects
A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.
PaddleX is an all-in-one development tool based on PaddlePaddle, providing AI pipelines for computer vision, NLP, and more.
An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.
Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper
Open-source video speech recognition & clipping tool with LLM-based AI clipping capabilities
Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.
A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.
A small speech recognition library written in C that can be used in a variety of applications.
A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.
A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.
Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools
This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.
A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.
Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.
An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.
Offline private voice assistant for many human languages, built with privacy and security in mind.
An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.
A PyTorch-based project for developing state-of-the-art speech recognition systems using the Kaldi toolkit.
A deep learning-based speech recognition library built on TensorFlow for developers working with AI-powered audio apps.
A React Native library for voice recognition on iOS and Android, with online and offline support.
Get weekly updates on trending AI coding tools and projects.