Explore Projects

Discover 75 open source projects

Active filters (1):
Search: whisper×
Clear all

Showing 1-20 of 75 projects

openai/whisper

Robust speech recognition model for multilingual tasks

95.5K
Stable
Python
AI Voice & Speech
PyTorch
#speech-recognition#multilingual#audio-processing

ggml-org/whisper.cpp

High-performance C/C++ port of OpenAI's Whisper for speech recognition

47.2K
Active
C++
Inference
CLI Tools
#speech-to-text#c++#inference

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K
Stable
Python
Inference
Local Inference Engines
CTranslate2
#speech-to-text#inference#quantization

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K
Active
Python
AI Voice & Speech
Python
#asr#speech-to-text#diarization

chidiwilliams/buzz

A Python library that uses OpenAI's Whisper to enable offline audio transcription and translation.

18.1K
Active
Python
LLM Wrappers & SDKs
#whisper#transcription#translation

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K
Active
Python
AI Voice & Speech
PyTorch
#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-translation

jamiepine/voicebox

Open-source voice synthesis studio powered by Qwen3-TTS

12.2K
Active
TypeScript
Voice AI & Synthesis
Whisper
#qwen3-tts#voice-ai#mlx

sashabaranov/go-openai

Go client for OpenAI's ChatGPT, GPT-5, DALL-E, and Whisper APIs, enabling AI-powered applications in Go.

10.6K
Stable
Go
LLM Wrappers & SDKs
Go
#chatgpt#gpt-4#gpt-5

Const-me/Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

10.2K
Archived
C++
Inference
#speech-recognition#whisper#asr

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K
Active
Rust
LLM Frameworks
#ai-meeting-assistant#transcription#speaker-diarization

QuentinFuxa/WhisperLiveKit

A simultaneous speech-to-text model powered by the Whisper AI library for real-time transcription.

9.8K
Active
Python
AI Voice & Speech
Python
#speech-to-text#transcription#whisper

niedev/RTranslator

Open source real-time translation app for Android that runs locally using AI models.

9.7K
Active
C++
LLM Wrappers & SDKs
Android
#translation#offline#realtime

xorbitsai/inference

Unified, production-ready inference API to run open-source, speech, and multimodal models on cloud, on-prem, or your laptop.

9.1K
Active
Python
LLM Frameworks
Inference
PyTorch
#artificial-intelligence#llm#inference

Vaibhavs10/insanely-fast-whisper

An open-source library for running the Whisper AI speech recognition model efficiently on a variety of platforms.

8.8K
Stable
Jupyter Notebook
LLM Wrappers & SDKs
API Frameworks
React
#speech-recognition#whisper#llm

adithya-s-k/omniparse

A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.

6.8K
Stable
Python
LLM Frameworks
File Storage
Python
#ingestion-api#ocr#parser-library

ddean2009/MoneyPrinterPlus

Automatically generates short videos with AI LLM and publishes them to multiple platforms.

5.8K
Experimental
Python
LLM Frameworks
BaaS Platforms
React
#authentication#streaming#real-time

argmaxinc/WhisperKit

An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.

5.7K
Active
Swift
AI Voice & Speech
iOS
#speech-recognition#transformers#inference

ConnectAI-E/feishu-openai

A Go-based framework for building chatbots and AI-powered assistants using Feishu (Lark) and OpenAI's GPT-4 models.

5.6K
Experimental
Go
LLM Frameworks
AI Coding Agents
Go
#chatgpt#openai#feishu-bot

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K
Stable
Jupyter Notebook
React
#asr#speaker-diarization#speech-recognition

Stay in the loop

Get weekly updates on trending AI coding tools and projects.