Explore Projects

Discover 69 open source projects

Active filters (1):
Search: speech-to-textร—
Clear all

Showing 41-60 of 69 projects

bugbakery/audapolis

An editor for spoken-word audio with automatic transcription, focused on the needs of vibe coders.

1.8K
Active
TypeScript
AI Voice & Speech
Audio Editing
React
#audio-editing#speech-to-text#transcription

mesolitica/NLP-Models-Tensorflow

A repository of machine learning and Tensorflow deep learning models for natural language processing problems.

1.8K
Archived
Jupyter Notebook
LLM Frameworks
Computer Vision
Tensorflow
#natural-language-processing#deep-learning#machine-learning

ideasman42/nerd-dictation

Simple, hackable offline speech-to-text tool using the VOSK-API, useful for vibe coders building AI apps.

1.8K
Stable
Python
AI Voice & Speech
CLI Tools
#speech-to-text#offline#hackable

kalliope-project/kalliope

Kalliope is a Python framework for creating your own personal assistant with speech recognition and synthesis.

1.8K
Archived
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#speech-synthesis#personal-assistant

iamsrikanthnani/pluely

An open-source AI assistant that works seamlessly in meetings, interviews, and conversations without detection.

1.6K
Active
TypeScript
AI Assistants
Desktop Apps
React
#ai-assistant#undetectable#privacy-first

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K
Active
Swift
AI Voice & Speech
iOS
Swift
#text-to-speech#speech-to-text#voice-activity-detection

OpenWhispr/openwhispr

Cross-platform voice-to-text app with local & cloud AI models, privacy-first architecture

1.6K
Active
TypeScript
Desktop Model Runners
AI Voice & Speech
Whisper
#voice-to-text#local-inference#whisper-api

Olcmyk/HuChenFeng

A collection of content from a Chinese internet personality, including livestream transcripts and text analysis.

1.5K
Stable
Speech-to-Text
Text Analysis
#archiving#content-analysis#livestream

antirez/voxtral.c

Pure C inference engine for Mistral Voxtral 4B speech-to-text model with minimal dependencies

1.5K
Active
C
Local Inference Engines
Inference
C
#speech-to-text#voxtral#c-inference

AlekPet/ComfyUI_Custom_Nodes_AlekPet

A collection of custom nodes that extend the capabilities of the ComfyUI AI coding tool.

1.5K
Active
JavaScript
AI Code Editors
LLM Frameworks
React
#comfyui#stable-diffusion#pose-detection

DragonComputer/Dragonfire

An open-source virtual assistant for Ubuntu-based Linux distributions, focused on speech recognition and natural language processing.

1.4K
Archived
Python
AI Voice & Speech
API Frameworks
#speech-recognition#natural-language-processing#ubuntu

royshil/obs-localvocal

An OBS plugin that enables real-time speech recognition and captioning using AI models like OpenAI Whisper.

1.4K
Active
C++
AI Voice & Speech
Realtime
#live-streaming#realtime-transcription#speech-recognition

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K
Archived
AI Voice & Speech
Databases
#speech-recognition#speech-synthesis#speech-processing

mkiol/dsnote

A Linux app for speech-to-text, text-to-speech, and machine translation, with offline capabilities.

1.4K
Active
C++
AI Voice & Speech
API Frameworks
#speech-recognition#speech-synthesis#machine-translation

R3gm/SoniTranslate

Synchronized Translation for Videos: Automatic dubbing and subtitling for video content.

1.3K
Stable
Python
AI Voice & Speech
CMS & Content
#video-dubbing#speech-to-text#text-to-speech

altic-dev/FluidVoice

macOS offline speech-to-text app using local MLโ€”no cloud, fully private voice dictation

1.3K
Active
Swift
Desktop Model Runners
AI Voice & Speech
Swift
#offline-dictation#voice-to-text#local-inference

Robitx/gp.nvim

A Neovim AI plugin that enables ChatGPT sessions, Instructable text/code operations, and Speech to Text functionality.

1.3K
Stable
Lua
LLM Wrappers & SDKs
AI Code Editors
Neovim
#neovim#chatgpt#speech-to-text

Capsize-Games/airunner

Offline AI-powered inference engine for art, chatbots, and automated workflows focused on privacy and self-hosting

1.3K
Stable
Python
Inference
AI Image & Video
PyGame
#ai#image-generation#chatbot

sdkcarlos/artyom.js

A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.

1.3K
Archived
JavaScript
AI Voice & Speech
Frontend Frameworks
JavaScript
#speech-recognition#speech-synthesis#voice-commands

ictnlp/StreamSpeech

An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.

1.2K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#speech-translation#speech-synthesis

Stay in the loop

Get weekly updates on trending AI coding tools and projects.