Showing 141-160 of 368 projects
This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.
A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.
A GUI for the faster-whisper library, which enables fast, open-source speech transcription using OpenAI's Whisper model.
AI subtitle generator for video with DaVinci Resolve integration, speaker diarization, runs locally.
Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.
Official Python SDK for ElevenLabs text-to-speech API with voice synthesis & audio generation.
Code for a demo of the OpenAI Speech API, allowing developers to explore and build speech-enabled applications.
A Python/C library and toolkit for automatically synchronizing audio and text (forced alignment).
An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.
Converse with ChatGPT using a web application built on top of SpeechGPT
Offline private voice assistant for many human languages, built with privacy and security in mind.
A fast, on-device, multilingual text-to-speech (TTS) library running natively via ONNX.
A Python library and CLI tool to interface with Google Translate's text-to-speech API.
A Python-based tool that enables easy deployment of ChatTTS, supporting features like streaming output, voice selection, and multi-character reading.
A speech recognition library for the web, allowing developers to build AI-powered applications.
An open-source, multilingual text-to-speech synthesis system written in pure Java.
An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.
A PyTorch-based audio source separation toolkit for researchers and developers working on AI audio applications.
A toolkit for self-supervised speech pre-training and representation learning.
Real-time audio/speech translation tool for Windows LiveCaptions
Get weekly updates on trending AI coding tools and projects.