Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 281-300 of 368 projects

Capsize-Games/airunner

Offline AI-powered inference engine for art, chatbots, and automated workflows focused on privacy and self-hosting

1.3K
Stable
Python
Inference
AI Image & Video
PyGame
#ai#image-generation#chatbot

kakaobrain/pororo

PORORO is a powerful Python library that provides a wide range of neural models for natural language processing tasks.

1.3K
Archived
Python
LLM Frameworks
AI Voice & Speech
Python
#natural-language-processing#speech-recognition#speech-synthesis

mmorise/World

A high-quality speech analysis, manipulation and synthesis system written in C++.

1.3K
Experimental
C++
AI Voice & Speech
#speech-analysis#speech-synthesis#vocoder

byjlw/video-analyzer

A Python library that uses LLMs, computer vision, and speech recognition to analyze video content.

1.3K
Experimental
Python
Computer Vision
LLM Frameworks
#video-processing#llms#asr

espressif/esp-sr

An open-source speech recognition library for the Espressif ESP32 microcontroller platform.

1.3K
Active
C
AI Voice & Speech
API Frameworks
#speech-recognition#esp32#microcontroller

MiniMax-AI/MiniMax-MCP

Official server for the MiniMax Model Context Protocol (MCP) that enables powerful AI capabilities like text-to-speech, image generation, and video generation.

1.3K
Active
Python
MCP Servers
AI Image & Video
Python
#mcp#text-to-speech#image-generation

haoheliu/voicefixer

A Python library for speech restoration, including tasks like declipping, denoising, and dereverberation.

1.3K
Experimental
Python
AI Voice & Speech
Signal Processing
Python
#speech-enhancement#audio-processing#signal-processing

YuanxunLu/LiveSpeechPortraits

Real-time photorealistic talking-head animation system built with Python and deep learning.

1.3K
Archived
Python
Computer Vision
AI Voice & Speech
React
#computer-vision#talking-head#speech-animation

Stypox/dicio-android

An Android assistant app that uses voice recognition, text-to-speech, and AI skills to provide a personal assistant experience.

1.3K
Active
Kotlin
AI Voice & Speech
Android
#assistant#voice-assistant#android

Renovamen/Speech-Emotion-Recognition

A speech emotion recognition library implemented in Keras with support for CNN, LSTM, SVM, and MLP models.

1.3K
Archived
Python
Speech & Voice
ML Ops
Keras
#speech-emotion-recognition#cnn#lstm

jishengpeng/WavTokenizer

A state-of-the-art discrete acoustic codec model for audio language modeling with 40/75 tokens per second.

1.3K
Experimental
Python
Speech Representation
Audio Representation
Python
#acoustic#audio-representation#codec

ratwithacompiler/OBS-captions-plugin

A C++ plugin for OBS Studio that adds closed captioning functionality using Google Speech Recognition.

1.3K
Stable
C++
API Frameworks
AI Voice & Speech
#closed-captioning#speech-recognition#obs-studio

sdkcarlos/artyom.js

A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.

1.3K
Archived
JavaScript
AI Voice & Speech
Frontend Frameworks
JavaScript
#speech-recognition#speech-synthesis#voice-commands

nazdridoy/kokoro-tts

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages and input formats.

1.3K
Stable
Python
AI Voice & Speech
CLI Tools
Python
#tts#audiobook#epub

xiaolai/public-speaking-with-meaning

This repository is a guide for giving effective public speeches, not a developer tool for vibe coders.

1.3K
Archived
Python
Tutorials & Courses
Books & Guides
#public-speaking#presentation-skills#communication-skills

TimoBolkart/voca

This codebase demonstrates how to synthesize realistic 3D character animations from speech input and a static mesh.

1.3K
Archived
Python
Computer Vision
API Frameworks
Python
#3d-animation#speech-to-animation#3d-face

shivammehta25/Matcha-TTS

Matcha-TTS is a fast and efficient text-to-speech (TTS) architecture using a conditional flow matching approach.

1.3K
Active
Jupyter Notebook
AI Voice & Speech
Diffusion Models
Jupyter Notebook
#text-to-speech#tts#diffusion-model

MycroftAI/mimic3

A fast local neural text-to-speech engine for Mycroft, an open-source voice assistant.

1.2K
Experimental
Python
AI Voice & Speech
Python
#text-to-speech#neural-network#voice-assistant

ictnlp/StreamSpeech

An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.

1.2K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#speech-translation#speech-synthesis

sh-lee-prml/HierSpeechpp

Official implementation of HierSpeech++, a hierarchical speech recognition model.

1.2K
Archived
Python
AI Voice & Speech
Python
#speech-recognition#hierarchical-models#neural-networks
1...1416...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.