Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Showing 141-160 of 368 projects

chenyme/Chenyme-AAVT

This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.

2.9K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#speech-recognition#video-translation#whisper

pluja/whishper

A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.

2.9K

Stable

Svelte

AI Voice & Speech

Frontend Frameworks

Svelte

#speech-recognition#speech-to-text#transcription

CheshireCC/faster-whisper-GUI

A GUI for the faster-whisper library, which enables fast, open-source speech transcription using OpenAI's Whisper model.

2.9K

Archived

Python

AI Voice & Speech

AI App Builders

PySide6

#speech-transcription#openai-whisper#voice-activity-detection

tmoroney/auto-subs

AI subtitle generator for video with DaVinci Resolve integration, speaker diarization, runs locally.

2.9K

Active

TypeScript

AI Voice & Speech

Desktop Model Runners

OpenAI

#ai-subtitles#davinci-resolve#speaker-diarization

Purfview/whisper-standalone-win

Standalone Windows executables for Whisper speech-to-text & diarization without Python setup.

2.9K

Stable

Desktop Model Runners

AI Voice & Speech

Whisper

#speech-to-text#whisper#faster-whisper

elevenlabs/elevenlabs-python

Official Python SDK for ElevenLabs text-to-speech API with voice synthesis & audio generation.

2.9K

Active

Python

AI SDKs & Wrappers

AI Voice & Speech

Python

#text-to-speech#voice-synthesis#elevenlabs-api

openai/openai-fm

Code for a demo of the OpenAI Speech API, allowing developers to explore and build speech-enabled applications.

2.8K

Stable

TypeScript

AI Voice & Speech

API Clients & Testing

TypeScript

#openai#speech-api#demo

readbeyond/aeneas

A Python/C library and toolkit for automatically synchronizing audio and text (forced alignment).

2.8K

Archived

Python

Audio & Speech

CLI Tools

Python

#audio#text-to-speech#alignment

linto-ai/whisper-timestamped

An open-source library for multilingual automatic speech recognition with word-level timestamps and confidence.

2.8K

Stable

Python

AI Voice & Speech

CLI Tools

PyTorch

#speech-recognition#multilingual#transformers

hahahumble/speechgpt

Converse with ChatGPT using a web application built on top of SpeechGPT

2.8K

Archived

TypeScript

React

#chatgpt#conversationalai#speechgpt

rhasspy/rhasspy

Offline private voice assistant for many human languages, built with privacy and security in mind.

2.7K

Experimental

Shell

API Frameworks

AI Voice & Speech

Node

#voice-assistant#speech-recognition#privacy

supertone-inc/supertonic

A fast, on-device, multilingual text-to-speech (TTS) library running natively via ONNX.

2.7K

Active

C++

AI Voice & Speech

API Frameworks

React

#text-to-speech#tts#on-device

pndurette/gTTS

A Python library and CLI tool to interface with Google Translate's text-to-speech API.

2.6K

Stable

Python

LLM Frameworks

React

#text-to-speech#speech-api#gtts

6drf21e/ChatTTS_colab

A Python-based tool that enables easy deployment of ChatTTS, supporting features like streaming output, voice selection, and multi-character reading.

2.6K

Archived

Python

AI Voice & Speech

AI App Builders

Colab

#text-to-speech#ai-voice#colab

zzmp/juliusjs

A speech recognition library for the web, allowing developers to build AI-powered applications.

2.6K

Archived

JavaScript

Prompt Engineering

React

#speech recognition#web development#AI-powered

marytts/marytts

An open-source, multilingual text-to-speech synthesis system written in pure Java.

2.6K

Archived

Java

AI Voice & Speech

#speech-synthesis#text-to-speech#tts

coqui-ai/STT

An open-source deep learning toolkit for building Speech-to-Text models and deploying them easily.

2.6K

Archived

C++

Speech Recognition

API Frameworks

TensorFlow

#speech-recognition#deep-learning#asr

asteroid-team/asteroid

A PyTorch-based audio source separation toolkit for researchers and developers working on AI audio applications.

2.5K

Stable

Python

Audio & Speech

PyTorch

#audio-separation#deep-learning#pretrained-models

s3prl/s3prl

A toolkit for self-supervised speech pre-training and representation learning.

2.5K

Experimental

Python

Speech Recognition

API Frameworks

Python

#speech-recognition#self-supervised-learning#representation-learning

SakiRinn/LiveCaptions-Translator

Real-time audio/speech translation tool for Windows LiveCaptions

2.5K

Active

Machine Learning & AI Editors

#LiveCaptions#Speech-to-Text#Translation

1...79...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.