Explore Projects

Discover 69 open source projects

Active filters (1):
Search: speech-to-textร—
Clear all

Showing 21-40 of 69 projects

snakers4/silero-models

Pre-trained text-to-speech models for various languages, made simple to use.

5.8K
Active
Jupyter Notebook
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#speech-synthesis#pre-trained-models

MahmoudAshraf97/whisper-diarization

Automatic Speech Recognition with Speaker Diarization using OpenAI Whisper

5.4K
Stable
Jupyter Notebook
React
#asr#speaker-diarization#speech-recognition

modelscope/FunClip

Open-source video speech recognition & clipping tool with LLM-based AI clipping capabilities

5.4K
Experimental
Python
LLM Frameworks
AI Voice & Speech
gradio
#speech-recognition#video-subtitles#llm

sanchit-gandhi/whisper-jax

A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.

4.7K
Archived
Jupyter Notebook
LLM Frameworks
Speech-to-Text
JAX
#speech-recognition#whisper#jax

gradio-app/fastrtc

A Python library for building real-time communication applications using AI tools like speech-to-text and text-to-speech.

4.5K
Active
JavaScript
AI Voice & Speech
Realtime
Python
#real-time#speech-to-text#text-to-speech

huggingface/speech-to-speech

Open-source and modular AI-powered speech-to-speech translation tool built with Python.

4.5K
Experimental
Python
Speech & Voice
API Frameworks
Python
#speech-recognition#speech-synthesis#speech-translation

ahmetoner/whisper-asr-webservice

A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.

3.2K
Stable
Python
AI Voice & Speech
API Clients & Testing
Flask
#speech-recognition#automatic-speech-recognition#openai-whisper

ictnlp/LLaMA-Omni

A high-quality end-to-end speech interaction model for AI-powered voice applications.

3.1K
Experimental
Python
LLM Frameworks
AI Voice & Speech
Python
#large-language-model#speech-interaction#speech-to-speech

HeyWillow/willow

Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools

3.0K
Experimental
C
AI Voice & Speech
AI App Builders
ESP-IDF
#alexa#google-home#speech-recognition

KevinWang676/Bark-Voice-Cloning

An open-source project for voice cloning and speech-to-text in Chinese, built using Jupyter Notebooks.

3.0K
Stable
Jupyter Notebook
AI Voice & Speech
#speech-to-text#voice-cloning#chinese-speech

pluja/whishper

A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.

2.9K
Stable
Svelte
AI Voice & Speech
Frontend Frameworks
Svelte
#speech-recognition#speech-to-text#transcription

tmoroney/auto-subs

AI subtitle generator for video with DaVinci Resolve integration, speaker diarization, runs locally.

2.9K
Active
TypeScript
AI Voice & Speech
Desktop Model Runners
OpenAI
#ai-subtitles#davinci-resolve#speaker-diarization

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K
Stable
Desktop Model Runners
AI Voice & Speech
Whisper
#speech-to-text#whisper#faster-whisper

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K
Stable
Python
AI Voice & Speech
CLI Tools
PyTorch
#speech-recognition#multilingual#transformers

coqui-ai/STT

An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.

2.6K
Archived
C++
Speech Recognition
API Frameworks
TensorFlow
#speech-recognition#deep-learning#asr

SakiRinn/LiveCaptions-Translator

Real-time audio/speech translation tool for Windows LiveCaptions

2.5K
Active
C#
Machine Learning & AI Editors
#LiveCaptions#Speech-to-Text#Translation

sindresorhus/awesome-whisper

An awesome list for Whisper, an open-source AI-powered speech recognition system by OpenAI.

2.2K
Stable
LLM Frameworks
AI Voice & Speech
#speech-to-text#transcription#openai

pannous/tensorflow-speech-recognition

A deep learning-based speech recognition library built on TensorFlow for developers working with AI-powered audio apps.

2.2K
Archived
Python
Speech Recognition
API Frameworks
TensorFlow
#speech-recognition#audio-processing#deep-learning

yan5xu/ququ

An open-source, privacy-first desktop voice assistant that integrates local speech recognition and configurable language models.

2.0K
Stable
JavaScript
AI Voice & Speech
AI App Builders
Electron
#ai-voice-recognition#speech-to-text#local-processing

ricky0123/vad

A TypeScript library for a voice activity detector (VAD) with a simple API for the browser.

1.9K
Active
TypeScript
AI Voice & Speech
Frontend Frameworks
TypeScript
#speech-to-text#voice-activity-detection#web-audio-api

Stay in the loop

Get weekly updates on trending AI coding tools and projects.