Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 161-180 of 368 projects

VITA-MLLM/VITA

A powerful multimodal AI model for real-time vision and speech interaction, built for developers who work with AI tools.

2.5K
Experimental
Python
LLM Frameworks
Agents & Orchestration
Python
#large-language-model#multimodal#video-understanding

nateshmbhat/pyttsx3

Offline Text-to-Speech library for Python developers to add speech synthesis to their applications.

2.5K
Stable
Python
AI Voice & Speech
#text-to-speech#python#offline

wiseman/py-webrtcvad

A Python interface to the WebRTC Voice Activity Detector, a library for detecting speech in audio signals.

2.4K
Archived
C
API Frameworks
AI Voice & Speech
#audio-processing#voice-activity-detection#webrtc

jameslyons/python_speech_features

This Python library provides common speech feature extraction functions for automatic speech recognition (ASR) tasks.

2.4K
Archived
Python
AI Voice & Speech
#speech-recognition#feature-extraction#mfcc

CjangCjengh/MoeGoe

Executable file for VITS inference, a neural text-to-speech model for generating high-quality speech.

2.4K
Archived
Python
LLM Frameworks
Inference
Python
#text-to-speech#neural-network#inference

explosion/spacy-course

A free online course teaching advanced NLP with spaCy

2.4K
Experimental
Python
Tutorials & Courses
Gatsby
#NLP#spaCy#online-course

thewh1teagle/kokoro-onnx

An open-source Python library for building text-to-speech (TTS) applications using the Kokoro engine and ONNX runtime.

2.4K
Active
Python
AI Voice & Speech
Python
#tts#text-to-speech#kokoro

mravanelli/pytorch-kaldi

A PyTorch-based project for developing state-of-the-art speech recognition systems using the Kaldi toolkit.

2.4K
Archived
Python
Speech Recognition
API Frameworks
PyTorch
#speech-recognition#deep-learning#kaldi

r9y9/wavenet_vocoder

A high-quality open-source PyTorch implementation of the WaveNet vocoder, a neural network for speech synthesis.

2.4K
Archived
Python
AI Voice & Speech
API Frameworks
PyTorch
#speech-synthesis#neural-vocoder#wavenet

cogentapps/chat-with-gpt

An open-source ChatGPT app with a voice, built using TypeScript.

2.4K
Archived
TypeScript
React
#ChatGPT#AI-powered chatbot#self-hosted

NVIDIA/waveglow

NVIDIA's WaveGlow is a flow-based generative network for high-quality speech synthesis in Python.

2.3K
Archived
Python
Speech
API Frameworks
#speech-synthesis#generative-models#audio-processing

earlephilhower/ESP8266Audio

An Arduino library to play various audio formats on ESP8266 and ESP32 devices with I2S DACs or software-emulated delta-sigma DACs.

2.3K
Stable
C
API Frameworks
Arduino & Embedded
#audio#dac#esp8266

Rayhane-mamah/Tacotron-2

Tacotron-2 is a state-of-the-art text-to-speech model that vibe coders can use to build speech synthesis applications.

2.3K
Archived
Python
Speech Synthesis
API Frameworks
TensorFlow
#speech-synthesis#text-to-speech#neural-networks

sindresorhus/awesome-whisper

An awesome list for Whisper, an open-source AI-powered speech recognition system by OpenAI.

2.2K
Stable
LLM Frameworks
AI Voice & Speech
#speech-to-text#transcription#openai

lifeiteng/vall-e

A PyTorch implementation of VALL-E, a zero-shot text-to-speech model for vibe coders.

2.2K
Stable
Python
LLM Frameworks
AI Voice & Speech
PyTorch
#chatgpt#in-context-learning#large-language-models

resemble-ai/resemble-enhance

An AI-powered speech denoising and enhancement library for improving audio quality.

2.2K
Archived
Python
AI Voice & Speech
Python
#denoise#speech-denoising#speech-enhancement

DigitalPhonetics/IMS-Toucan

A fast and controllable text-to-speech library supporting over 7000 languages using deep learning and PyTorch.

2.2K
Active
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#speech-synthesis#deep-learning

m1guelpf/auto-subtitle

Automatically generate and overlay subtitles for any video using Whisper, a powerful AI speech recognition model.

2.2K
Archived
Python
AI Voice & Speech
Subtitles & Captions
Python
#ffmpeg#openai-whisper#subtitle-generator

fatchord/WaveRNN

A high-quality neural vocoder and text-to-speech (TTS) library built with PyTorch.

2.2K
Archived
Python
AI Voice & Speech
PyTorch
#neural-vocoder#speech-synthesis#text-to-speech

pannous/tensorflow-speech-recognition

A deep learning-based speech recognition library built on TensorFlow for developers working with AI-powered audio apps.

2.2K
Archived
Python
Speech Recognition
API Frameworks
TensorFlow
#speech-recognition#audio-processing#deep-learning
1...810...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.