Showing 21-40 of 69 projects
Pre-trained text-to-speech models for various languages, made simple to use.
Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper
Open-source video speech recognition & clipping tool with LLM-based AI clipping capabilities
A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.
A Python library for building real-time communication applications using AI tools like speech-to-text and text-to-speech.
Open-source and modular AI-powered speech-to-speech translation tool built with Python.
A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.
A high-quality end-to-end speech interaction model for AI-powered voice applications.
Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools
An open-source project for voice cloning and speech-to-text in Chinese, built using Jupyter Notebooks.
A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.
AI subtitle generator for video with DaVinci Resolve integration, speaker diarization, runs locally.
Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.
An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.
An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.
Real-time audio/speech translation tool for Windows LiveCaptions
An awesome list for Whisper, an open-source AI-powered speech recognition system by OpenAI.
A deep learning-based speech recognition library built on TensorFlow for developers working with AI-powered audio apps.
An open-source, privacy-first desktop voice assistant that integrates local speech recognition and configurable language models.
A TypeScript library for a voice activity detector (VAD) with a simple API for the browser.
Get weekly updates on trending AI coding tools and projects.