Showing 1-20 of 75 projects
Robust speech recognition model for multilingual tasks
High-performance C/C++ port of OpenAI's Whisper for speech recognition
Faster Whisper transcription with CTranslate2 for efficient speech-to-text
WhisperX for fast ASR with word-level timestamps and diarization
A Python library that uses OpenAI's Whisper to enable offline audio transcription and translation.
A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
Open-source voice synthesis studio powered by Qwen3-TTS
Go client for OpenAI's ChatGPT, GPT-5, DALL-E, and Whisper APIs, enabling AI-powered applications in Go.
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.
A simultaneous speech-to-text model powered by the Whisper AI library for real-time transcription.
Open source real-time translation app for Android that runs locally using AI models.
Unified, production-ready inference API to run open-source, speech, and multimodal models on cloud, on-prem, or your laptop.
An open-source library for running the Whisper AI speech recognition model efficiently on a variety of platforms.
A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.
Automatically generates short videos with AI LLM and publishes them to multiple platforms.
An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.
A Go-based framework for building chatbots and AI-powered assistants using Feishu (Lark) and OpenAI's GPT-4 models.
Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper
Get weekly updates on trending AI coding tools and projects.