Explore Projects

Discover 75 open source projects

Active filters (1):

Search: whisper×

Clear all

Showing 1-20 of 75 projects

openai/whisper

Robust speech recognition model for multilingual tasks

95.5K

Stable

Python

AI Voice & Speech

PyTorch

#speech-recognition#multilingual#audio-processing

ggml-org/whisper.cpp

High-performance C/C++ port of OpenAI's Whisper for speech recognition

47.2K

Active

C++

Inference

CLI Tools

#speech-to-text#c++#inference

SYSTRAN/faster-whisper

Faster Whisper transcription with CTranslate2 for efficient speech-to-text

21.3K

Stable

Python

Inference

Local Inference Engines

CTranslate2

#speech-to-text#inference#quantization

m-bain/whisperX

WhisperX for fast ASR with word-level timestamps and diarization

20.5K

Active

Python

AI Voice & Speech

Python

#asr#speech-to-text#diarization

chidiwilliams/buzz

A Python library that uses OpenAI's Whisper to enable offline audio transcription and translation.

18.1K

Active

Python

LLM Wrappers & SDKs

#whisper#transcription#translation

modelscope/FunASR

A comprehensive speech recognition toolkit with state-of-the-art pretrained models for various speech tasks.

15.1K

Active

Python

AI Voice & Speech

PyTorch

#speech-recognition#voice-activity-detection#audio-visual-speech-recognition

PaddlePaddle/PaddleSpeech

Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.

12.5K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-translation

jamiepine/voicebox

Open-source voice synthesis studio powered by Qwen3-TTS

12.2K

Active

TypeScript

Voice AI & Synthesis

Whisper

#qwen3-tts#voice-ai#mlx

sashabaranov/go-openai

Go client for OpenAI's ChatGPT, GPT-5, DALL-E, and Whisper APIs, enabling AI-powered applications in Go.

10.6K

Stable

LLM Wrappers & SDKs

#chatgpt#gpt-4#gpt-5

Const-me/Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

10.2K

Archived

C++

Inference

#speech-recognition#whisper#asr

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K

Active

Rust

LLM Frameworks

#ai-meeting-assistant#transcription#speaker-diarization

QuentinFuxa/WhisperLiveKit

A simultaneous speech-to-text model powered by the Whisper AI library for real-time transcription.

9.8K

Active

Python

AI Voice & Speech

Python

#speech-to-text#transcription#whisper

niedev/RTranslator

Open source real-time translation app for Android that runs locally using AI models.

9.7K

Active

C++

LLM Wrappers & SDKs

Android

#translation#offline#realtime

xorbitsai/inference

Unified, production-ready inference API to run open-source, speech, and multimodal models on cloud, on-prem, or your laptop.

9.1K

Active

Python

LLM Frameworks

Inference

PyTorch

#artificial-intelligence#llm#inference

Vaibhavs10/insanely-fast-whisper

An open-source library for running the Whisper AI speech recognition model efficiently on a variety of platforms.

8.8K

Stable

Jupyter Notebook

LLM Wrappers & SDKs

API Frameworks

React

#speech-recognition#whisper#llm

adithya-s-k/omniparse

A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.

6.8K

Stable

Python

LLM Frameworks

File Storage

Python

#ingestion-api#ocr#parser-library

ddean2009/MoneyPrinterPlus

Automatically generates short videos with AI LLM and publishes them to multiple platforms.

5.8K

Experimental

Python

LLM Frameworks

BaaS Platforms

React

#authentication#streaming#real-time

argmaxinc/WhisperKit

An open-source on-device speech recognition library for Apple Silicon devices, built with Swift and Transformers.

5.7K

Active

Swift

AI Voice & Speech

iOS

#speech-recognition#transformers#inference

ConnectAI-E/feishu-openai

A Go-based framework for building chatbots and AI-powered assistants using Feishu (Lark) and OpenAI's GPT-4 models.

5.6K

Experimental

LLM Frameworks

AI Coding Agents

#chatgpt#openai#feishu-bot

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K

Stable

Jupyter Notebook

React

#asr#speaker-diarization#speech-recognition

2 3 4

Stay in the loop

Get weekly updates on trending AI coding tools and projects.