Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Clear all

Showing 281-300 of 368 projects

Capsize-Games/airunner

Offline AI-powered inference engine for art, chatbots, and automated workflows focused on privacy and self-hosting

1.3K

Stable

Python

Inference

AI Image & Video

PyGame

#ai#image-generation#chatbot

kakaobrain/pororo

PORORO is a powerful Python library that provides a wide range of neural models for natural language processing tasks.

1.3K

Archived

Python

LLM Frameworks

AI Voice & Speech

Python

#natural-language-processing#speech-recognition#speech-synthesis

mmorise/World

A high-quality speech analysis, manipulation and synthesis system written in C++.

1.3K

Experimental

C++

AI Voice & Speech

#speech-analysis#speech-synthesis#vocoder

byjlw/video-analyzer

A Python library that uses LLMs, computer vision, and speech recognition to analyze video content.

1.3K

Experimental

Python

Computer Vision

LLM Frameworks

#video-processing#llms#asr

espressif/esp-sr

An open-source speech recognition library for the Espressif ESP32 microcontroller platform.

1.3K

Active

AI Voice & Speech

API Frameworks

#speech-recognition#esp32#microcontroller

MiniMax-AI/MiniMax-MCP

Official server for the MiniMax Model Context Protocol (MCP) that enables powerful AI capabilities like text-to-speech, image generation, and video generation.

1.3K

Active

Python

MCP Servers

AI Image & Video

Python

#mcp#text-to-speech#image-generation

haoheliu/voicefixer

A Python library for speech restoration, including tasks like declipping, denoising, and dereverberation.

1.3K

Experimental

Python

AI Voice & Speech

Signal Processing

Python

#speech-enhancement#audio-processing#signal-processing

YuanxunLu/LiveSpeechPortraits

Real-time photorealistic talking-head animation system built with Python and deep learning.

1.3K

Archived

Python

Computer Vision

AI Voice & Speech

React

#computer-vision#talking-head#speech-animation

Stypox/dicio-android

An Android assistant app that uses voice recognition, text-to-speech, and AI skills to provide a personal assistant experience.

1.3K

Active

Kotlin

AI Voice & Speech

Android

#assistant#voice-assistant#android

Renovamen/Speech-Emotion-Recognition

A speech emotion recognition library implemented in Keras with support for CNN, LSTM, SVM, and MLP models.

1.3K

Archived

Python

Speech & Voice

ML Ops

Keras

#speech-emotion-recognition#cnn#lstm

jishengpeng/WavTokenizer

A state-of-the-art discrete acoustic codec model for audio language modeling with 40/75 tokens per second.

1.3K

Experimental

Python

Speech Representation

Audio Representation

Python

#acoustic#audio-representation#codec

ratwithacompiler/OBS-captions-plugin

A C++ plugin for OBS Studio that adds closed captioning functionality using Google Speech Recognition.

1.3K

Stable

C++

API Frameworks

AI Voice & Speech

#closed-captioning#speech-recognition#obs-studio

sdkcarlos/artyom.js

A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.

1.3K

Archived

JavaScript

AI Voice & Speech

Frontend Frameworks

JavaScript

#speech-recognition#speech-synthesis#voice-commands

nazdridoy/kokoro-tts

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages and input formats.

1.3K

Stable

Python

AI Voice & Speech

CLI Tools

Python

#tts#audiobook#epub

xiaolai/public-speaking-with-meaning

This repository is a guide for giving effective public speeches, not a developer tool for vibe coders.

1.3K

Archived

Python

Tutorials & Courses

Books & Guides

#public-speaking#presentation-skills#communication-skills

TimoBolkart/voca

This codebase demonstrates how to synthesize realistic 3D character animations from speech input and a static mesh.

1.3K

Archived

Python

Computer Vision

API Frameworks

Python

#3d-animation#speech-to-animation#3d-face

shivammehta25/Matcha-TTS

Matcha-TTS is a fast and efficient text-to-speech (TTS) architecture using a conditional flow matching approach.

1.3K

Active

Jupyter Notebook

AI Voice & Speech

Diffusion Models

Jupyter Notebook

#text-to-speech#tts#diffusion-model

MycroftAI/mimic3

A fast local neural text-to-speech engine for Mycroft, an open-source voice assistant.

1.2K

Experimental

Python

AI Voice & Speech

Python

#text-to-speech#neural-network#voice-assistant

ictnlp/StreamSpeech

An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.

1.2K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#speech-recognition#speech-translation#speech-synthesis

sh-lee-prml/HierSpeechpp

Official implementation of HierSpeech++, a hierarchical speech recognition model.

1.2K

Archived

Python

AI Voice & Speech

Python

#speech-recognition#hierarchical-models#neural-networks

1...1416...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.