Explore Projects

Discover 45 open source projects

Active filters (1):
Search: transcriptร—
Clear all

Showing 21-40 of 45 projects

jianfch/stable-ts

A Python library for transcription, forced alignment, and audio indexing using OpenAI's Whisper model.

2.2K
Stable
Python
AI Voice & Speech
API Frameworks
#audio-transcription#forced-alignment#audio-indexing

bytedance/piano_transcription

An open-source research project for piano transcription, a key component in AI music generation.

1.9K
Archived
Python
Computer Vision
AI SDKs & Wrappers
Python
#music#audio#transcription

juanmc2005/diart

A Python package for building real-time, AI-powered audio applications like speaker diarization and voice activity detection.

1.9K
Experimental
Python
AI Voice & Speech
API Frameworks
#real-time#speaker-diarization#speaker-embedding

kadirnar/whisper-plus

WhisperPlus is a faster, smarter, and more capable AI-powered audio transcription library built on top of OpenAI's Whisper model.

1.9K
Stable
Python
LLM Wrappers & SDKs
CLI Tools
React
#audio-transcription#openai-whisper#ai-powered

jimmc414/onefilellm

A tool that makes it easy to scrape and ingest content from various sources like GitHub, arXiv, and YouTube for use with large language models.

1.9K
Stable
Python
LLM Frameworks
CLI Tools
Python
#llm#text-extraction#data-ingestion

Music-and-Culture-Technology-Lab/omnizart

A comprehensive music transcription library that can detect beat, chord, drum, vocal, and instrument components.

1.8K
Archived
Python
Music Information Retrieval
CLI Tools
Python
#music-transcription#beat-tracking#chord-detection

bugbakery/audapolis

An editor for spoken-word audio with automatic transcription, focused on the needs of vibe coders.

1.8K
Active
TypeScript
AI Voice & Speech
Audio Editing
React
#audio-editing#speech-to-text#transcription

Mentra-Community/MentraOS

An open-source smart glasses platform and SDK for streaming, transcription, and AI-powered interactions.

1.8K
Active
TypeScript
AI SDKs & Wrappers
Component Libraries (React)
React
#smart-glasses#wearable#conversational-ai

NotJoeMartinez/yt-fts

A command-line tool to search and download full transcripts of YouTube videos for semantic search and analysis.

1.8K
Active
Python
LLM Wrappers & SDKs
CLI Tools
Python
#youtube#full-text-search#semantic-search

kaixxx/noScribe

Cutting-edge AI-powered audio transcription tool with a user-friendly GUI and support for speaker identification.

1.8K
Active
Python
LLM Frameworks
AI Voice & Speech
Python
#audio-transcription#speaker-identification#whisper

Vexa-ai/vexa

Self-hosted, multi-user API that drops bots into Google Meet for real-time transcripts.

1.7K
Active
Python
Collaboration & Real-time
Agents & Orchestration
Python
#google-meet#meeting-assistant#meeting-transcripts

hrishioa/lumentis

Generates one-click comprehensive documentation from transcripts and text using AI-powered tools.

1.7K
Experimental
TypeScript
React
#authentication#AI-powered#documentation

magenta/mt3

MT3 is a Python library for multi-task multitrack music transcription, a powerful tool for audio analysis.

1.7K
Active
Python
LLM Frameworks
API Frameworks
Python
#audio-analysis#music-transcription#multi-track

beyondcode/writeout.ai

Free transcription and translation service for audio files, built with PHP.

1.5K
Archived
PHP
File Storage
API Clients & Testing
#transcription#translation#audio-processing

Olcmyk/HuChenFeng

A collection of content from a Chinese internet personality, including livestream transcripts and text analysis.

1.5K
Stable
Speech-to-Text
Text Analysis
#archiving#content-analysis#livestream

azuwis/pianotrans

A simple GUI for ByteDance's Piano Transcription with Pedals, built using the Nix programming language.

1.4K
Active
Nix
Computer Vision
Music
#piano#transcription#ai-powered

hungtraan/FacebookBot

A Facebook Messenger Bot with voice recognition, NLP, and features like restaurant search and memo transcription.

1.4K
Archived
JavaScript
AI Voice & Speech
API Frameworks
Node
#voice-recognition#natural-language-processing#restaurant-search

royshil/obs-localvocal

An OBS plugin that enables real-time speech recognition and captioning using AI models like OpenAI Whisper.

1.4K
Active
C++
AI Voice & Speech
Realtime
#live-streaming#realtime-transcription#speech-recognition

finnvoor/yap

A CLI for on-device speech transcription using Speech.framework on macOS

1.4K
Stable
Swift
AI Code Editors
MCP Frameworks
React
#speech-transcription#on-device#macOS

discord-tickets/bot

A popular open-source Discord ticket management bot with features like ticket transcripts and self-hosting.

1.4K
Active
JavaScript
API Frameworks
GitHub Profiles
Discord.js
#discord-bot#ticket-management#discord-js

Stay in the loop

Get weekly updates on trending AI coding tools and projects.