Showing 281-300 of 368 projects
Offline AI-powered inference engine for art, chatbots, and automated workflows focused on privacy and self-hosting
PORORO is a powerful Python library that provides a wide range of neural models for natural language processing tasks.
A high-quality speech analysis, manipulation and synthesis system written in C++.
A Python library that uses LLMs, computer vision, and speech recognition to analyze video content.
An open-source speech recognition library for the Espressif ESP32 microcontroller platform.
Official server for the MiniMax Model Context Protocol (MCP) that enables powerful AI capabilities like text-to-speech, image generation, and video generation.
A Python library for speech restoration, including tasks like declipping, denoising, and dereverberation.
Real-time photorealistic talking-head animation system built with Python and deep learning.
An Android assistant app that uses voice recognition, text-to-speech, and AI skills to provide a personal assistant experience.
A speech emotion recognition library implemented in Keras with support for CNN, LSTM, SVM, and MLP models.
A state-of-the-art discrete acoustic codec model for audio language modeling with 40/75 tokens per second.
A C++ plugin for OBS Studio that adds closed captioning functionality using Google Speech Recognition.
A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages and input formats.
This repository is a guide for giving effective public speeches, not a developer tool for vibe coders.
This codebase demonstrates how to synthesize realistic 3D character animations from speech input and a static mesh.
Matcha-TTS is a fast and efficient text-to-speech (TTS) architecture using a conditional flow matching approach.
A fast local neural text-to-speech engine for Mycroft, an open-source voice assistant.
An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.
Official implementation of HierSpeech++, a hierarchical speech recognition model.
Get weekly updates on trending AI coding tools and projects.