Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 141-160 of 368 projects

chenyme/Chenyme-AAVT

This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.

2.9K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#video-translation#whisper

pluja/whishper

A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.

2.9K
Stable
Svelte
AI Voice & Speech
Frontend Frameworks
Svelte
#speech-recognition#speech-to-text#transcription

CheshireCC/faster-whisper-GUI

A GUI for the faster-whisper library, which enables fast, open-source speech transcription using OpenAI's Whisper model.

2.9K
Archived
Python
AI Voice & Speech
AI App Builders
PySide6
#speech-transcription#openai-whisper#voice-activity-detection

tmoroney/auto-subs

AI subtitle generator for video with DaVinci Resolve integration, speaker diarization, runs locally.

2.9K
Active
TypeScript
AI Voice & Speech
Desktop Model Runners
OpenAI
#ai-subtitles#davinci-resolve#speaker-diarization

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K
Stable
Desktop Model Runners
AI Voice & Speech
Whisper
#speech-to-text#whisper#faster-whisper

elevenlabs/elevenlabs-python

Official Python SDK for ElevenLabs text-to-speech API with voice synthesis & audio generation.

2.9K
Active
Python
AI SDKs & Wrappers
AI Voice & Speech
Python
#text-to-speech#voice-synthesis#elevenlabs-api

openai/openai-fm

Code for a demo of the OpenAI Speech API, allowing developers to explore and build speech-enabled applications.

2.8K
Stable
TypeScript
AI Voice & Speech
API Clients & Testing
TypeScript
#openai#speech-api#demo

readbeyond/aeneas

A Python/C library and toolkit for automatically synchronizing audio and text (forced alignment).

2.8K
Archived
Python
Audio & Speech
CLI Tools
Python
#audio#text-to-speech#alignment

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K
Stable
Python
AI Voice & Speech
CLI Tools
PyTorch
#speech-recognition#multilingual#transformers

hahahumble/speechgpt

Converse with ChatGPT using a web application built on top of SpeechGPT

2.8K
Archived
TypeScript
React
#chatgpt#conversationalai#speechgpt

rhasspy/rhasspy

Offline private voice assistant for many human languages, built with privacy and security in mind.

2.7K
Experimental
Shell
API Frameworks
AI Voice & Speech
Node
#voice-assistant#speech-recognition#privacy

supertone-inc/supertonic

A fast, on-device, multilingual text-to-speech (TTS) library running natively via ONNX.

2.7K
Active
C++
AI Voice & Speech
API Frameworks
React
#text-to-speech#tts#on-device

pndurette/gTTS

A Python library and CLI tool to interface with Google Translate's text-to-speech API.

2.6K
Stable
Python
LLM Frameworks
React
#text-to-speech#speech-api#gtts

6drf21e/ChatTTS_colab

A Python-based tool that enables easy deployment of ChatTTS, supporting features like streaming output, voice selection, and multi-character reading.

2.6K
Archived
Python
AI Voice & Speech
AI App Builders
Colab
#text-to-speech#ai-voice#colab

zzmp/juliusjs

A speech recognition library for the web, allowing developers to build AI-powered applications.

2.6K
Archived
JavaScript
Prompt Engineering
React
#speech recognition#web development#AI-powered

marytts/marytts

An open-source, multilingual text-to-speech synthesis system written in pure Java.

2.6K
Archived
Java
AI Voice & Speech
#speech-synthesis#text-to-speech#tts

coqui-ai/STT

An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.

2.6K
Archived
C++
Speech Recognition
API Frameworks
TensorFlow
#speech-recognition#deep-learning#asr

asteroid-team/asteroid

A PyTorch-based audio source separation toolkit for researchers and developers working on AI audio applications.

2.5K
Stable
Python
Audio & Speech
PyTorch
#audio-separation#deep-learning#pretrained-models

s3prl/s3prl

A toolkit for self-supervised speech pre-training and representation learning.

2.5K
Experimental
Python
Speech Recognition
API Frameworks
Python
#speech-recognition#self-supervised-learning#representation-learning

SakiRinn/LiveCaptions-Translator

Real-time audio/speech translation tool for Windows LiveCaptions

2.5K
Active
C#
Machine Learning & AI Editors
#LiveCaptions#Speech-to-Text#Translation
1...79...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.