Showing 121-140 of 368 projects
A lightweight, fast, and efficient text-to-speech library for developers who need to add voice functionality to their projects.
Sample code for the Microsoft Cognitive Services Speech SDK, which allows developers to build voice-enabled applications.
A TypeScript-powered web app that brings ML-powered speech recognition to the browser using the Whisper AI model.
An open-source Chinese NLP library providing state-of-the-art tools for word segmentation, dependency parsing, named entity recognition, and more.
A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.
A Python library for converting video files to text transcripts using AI-powered speech recognition.
A digital avatar conversational system that combines large language models with visual models for novel human-AI interaction.
Foundation Architecture for (M)LLMs, a powerful toolkit for building large language models.
Fast local neural text-to-speech engine for offline voice synthesis
A high-quality end-to-end speech interaction model for AI-powered voice applications.
A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.
Curated list of projects that promote human-centric technology and ethical digital solutions.
An open-source text-to-speech software that enables high-quality, free-to-use voice generation.
A high-quality voice conversion tool focused on ease of use and performance for AI-powered audio applications.
An open-source library for converting speech to text using OpenAI's Whisper AI model, with Docker support.
A versatile WebUI for various AI-powered text-to-speech engines, enabling vibe coders to explore and utilize cutting-edge audio generation tools.
An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.
A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model
Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools
An open-source project for voice cloning and speech-to-text in Chinese, built using Jupyter Notebooks.
Get weekly updates on trending AI coding tools and projects.