Showing 181-200 of 228 projects
Offline AI-powered inference engine for art, chatbots, and automated workflows focused on privacy and self-hosting
A TypeScript extension for VS Code that enables voice-driven coding, reducing the need for cursor and keyboard.
Official server for the MiniMax Model Context Protocol (MCP) that enables powerful AI capabilities like text-to-speech, image generation, and video generation.
An Android assistant app that uses voice recognition, text-to-speech, and AI skills to provide a personal assistant experience.
A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages and input formats.
An SDK for commercial device makers to integrate Alexa directly into connected products.
A fast local neural text-to-speech engine for Mycroft, an open-source voice assistant.
This project provides advanced voiceprint recognition models and data preprocessing methods using PyTorch.
The official ElevenLabs MCP server, a Python-based server for the ElevenLabs AI-powered voice synthesis platform.
An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.
A curated collection of resources for ChatTTS, including free demos, voice samples, and more for vibe coders.
An all-in-one web UI for different audio-related neural networks, including text-to-speech, voice cloning, and generative music.
A remote voice satellite using the Wyoming protocol, built with Python.
Easily manage your preferred AI digital assistants on Android with this open-source Android app.
A Linux distribution for voice-enabled IoT that embraces web standards and JavaScript/Node.js.
Real-time voice interactive digital human with customizable appearance and voice, supporting voice cloning.
LPCNet is an efficient neural speech synthesis library for developers building voice-based applications.
A voice chat app built with Python that leverages AI-powered speech recognition and text-to-speech capabilities.
An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.
Get weekly updates on trending AI coding tools and projects.