Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Showing 161-180 of 368 projects

VITA-MLLM/VITA

A powerful multimodal AI model for real-time vision and speech interaction, built for developers who work with AI tools.

2.5K

Experimental

Python

LLM Frameworks

Agents & Orchestration

Python

#large-language-model#multimodal#video-understanding

nateshmbhat/pyttsx3

Offline Text-to-Speech library for Python developers to add speech synthesis to their applications.

2.5K

Stable

Python

AI Voice & Speech

#text-to-speech#python#offline

wiseman/py-webrtcvad

A Python interface to the WebRTC Voice Activity Detector, a library for detecting speech in audio signals.

2.4K

Archived

API Frameworks

AI Voice & Speech

#audio-processing#voice-activity-detection#webrtc

jameslyons/python_speech_features

This Python library provides common speech feature extraction functions for automatic speech recognition (ASR) tasks.

2.4K

Archived

Python

AI Voice & Speech

#speech-recognition#feature-extraction#mfcc

CjangCjengh/MoeGoe

Executable file for VITS inference, a neural text-to-speech model for generating high-quality speech.

2.4K

Archived

Python

LLM Frameworks

Inference

Python

#text-to-speech#neural-network#inference

explosion/spacy-course

A free online course teaching advanced NLP with spaCy

2.4K

Experimental

Python

Tutorials & Courses

Gatsby

#NLP#spaCy#online-course

thewh1teagle/kokoro-onnx

An open-source Python library for building text-to-speech (TTS) applications using the Kokoro engine and ONNX runtime.

2.4K

Active

Python

AI Voice & Speech

Python

#tts#text-to-speech#kokoro

mravanelli/pytorch-kaldi

A PyTorch-based project for developing state-of-the-art speech recognition systems using the Kaldi toolkit.

2.4K

Archived

Python

Speech Recognition

API Frameworks

PyTorch

#speech-recognition#deep-learning#kaldi

r9y9/wavenet_vocoder

A high-quality open-source PyTorch implementation of the WaveNet vocoder, a neural network for speech synthesis.

2.4K

Archived

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-synthesis#neural-vocoder#wavenet

cogentapps/chat-with-gpt

An open-source ChatGPT app with a voice, built using TypeScript.

2.4K

Archived

TypeScript

React

#ChatGPT#AI-powered chatbot#self-hosted

NVIDIA/waveglow

NVIDIA's WaveGlow is a flow-based generative network for high-quality speech synthesis in Python.

2.3K

Archived

Python

Speech

API Frameworks

#speech-synthesis#generative-models#audio-processing

earlephilhower/ESP8266Audio

An Arduino library to play various audio formats on ESP8266 and ESP32 devices with I2S DACs or software-emulated delta-sigma DACs.

2.3K

Stable

API Frameworks

Arduino & Embedded

#audio#dac#esp8266

Rayhane-mamah/Tacotron-2

Tacotron-2 is a state-of-the-art text-to-speech model that vibe coders can use to build speech synthesis applications.

2.3K

Archived

Python

Speech Synthesis

API Frameworks

TensorFlow

#speech-synthesis#text-to-speech#neural-networks

sindresorhus/awesome-whisper

An awesome list for Whisper, an open-source AI-powered speech recognition system by OpenAI.

2.2K

Stable

LLM Frameworks

AI Voice & Speech

#speech-to-text#transcription#openai

lifeiteng/vall-e

A PyTorch implementation of VALL-E, a zero-shot text-to-speech model for vibe coders.

2.2K

Stable

Python

LLM Frameworks

AI Voice & Speech

PyTorch

#chatgpt#in-context-learning#large-language-models

resemble-ai/resemble-enhance

An AI-powered speech denoising and enhancement library for improving audio quality.

2.2K

Archived

Python

AI Voice & Speech

Python

#denoise#speech-denoising#speech-enhancement

DigitalPhonetics/IMS-Toucan

A fast and controllable text-to-speech library supporting over 7000 languages using deep learning and PyTorch.

2.2K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#speech-synthesis#deep-learning

m1guelpf/auto-subtitle

Automatically generate and overlay subtitles for any video using Whisper, a powerful AI speech recognition model.

2.2K

Archived

Python

AI Voice & Speech

Subtitles & Captions

Python

#ffmpeg#openai-whisper#subtitle-generator

fatchord/WaveRNN

A high-quality neural vocoder and text-to-speech (TTS) library built with PyTorch.

2.2K

Archived

Python

AI Voice & Speech

PyTorch

#neural-vocoder#speech-synthesis#text-to-speech

pannous/tensorflow-speech-recognition

A deep learning-based speech recognition library built on TensorFlow for developers working with AI-powered audio apps.

2.2K

Archived

Python

Speech Recognition

API Frameworks

TensorFlow

#speech-recognition#audio-processing#deep-learning

1...810...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.