Explore Projects

Discover 124 open source projects

Active filters (1):

Search: text-to-speech×

Showing 81-100 of 124 projects

BytedanceSpeech/seed-tts-eval

This Python repository provides an evaluation framework for text-to-speech models, focusing on enabling vibe coder development with AI tools.

1.5K

Archived

Python

AI Voice & Speech

Testing

Python

#text-to-speech#speech-synthesis#model-evaluation

voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

1.4K

Archived

Python

Prompt Engineering

React

#text-to-speech#tts#voice-cloning

edwko/OuteTTS

Interface for OuteTTS models, a Python library for text-to-speech using transformer-based models.

1.4K

Experimental

Python

LLM Frameworks

AI Voice & Speech

#text-to-speech#transformers#llama

High-Logic/Genie-TTS

A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.

1.4K

Active

Python

AI Voice & Speech

CLI Tools

Python

#gpt-sovits#text-to-speech#tts

DragonComputer/Dragonfire

An open-source virtual assistant for Ubuntu-based Linux distributions, focused on speech recognition and natural language processing.

1.4K

Archived

Python

AI Voice & Speech

API Frameworks

#speech-recognition#natural-language-processing#ubuntu

innnky/emotional-vits

An emotion-controllable text-to-speech model for vibe coders, built on the VITS framework.

1.4K

Archived

Jupyter Notebook

AI Voice & Speech

Jupyter Notebook

#text-to-speech#emotion-control#ai-voice

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K

Archived

AI Voice & Speech

Databases

#speech-recognition#speech-synthesis#speech-processing

lenML/Speech-AI-Forge

A Python-based project that provides a TTS API server and Gradio-based web UI for speech synthesis and voice generation.

1.4K

Active

Python

AI Voice & Speech

API Frameworks

Gradio

#text-to-speech#speech-synthesis#api-server

mkiol/dsnote

A Linux app for speech-to-text, text-to-speech, and machine translation, with offline capabilities.

1.4K

Active

C++

AI Voice & Speech

API Frameworks

#speech-recognition#speech-synthesis#machine-translation

Enemyx-net/VibeVoice-ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

1.4K

Stable

Python

AI Voice & Speech

ComfyUI Custom Nodes

React

#text-to-speech#voice-cloning#ComfyUI

LuckyHookin/edge-TTS-record

A tool to record Microsoft Edge browser's text-to-speech (TTS) audio and output it as .wav files on Windows.

1.4K

Archived

HTML

Frontend Frameworks

CLI Tools

Vue.js

#edge#tts#audio-recording

lamm-mit/PDF2Audio

A Jupyter Notebook project that converts PDF documents to audio using AI-powered text-to-speech.

1.4K

Experimental

Jupyter Notebook

AI Voice & Speech

Jupyter Notebook

#text-to-speech#pdf#audio

R3gm/SoniTranslate

Synchronized Translation for Videos: Automatic dubbing and subtitling for video content.

1.3K

Stable

Python

AI Voice & Speech

CMS & Content

#video-dubbing#speech-to-text#text-to-speech

kripken/speak.js

speak.js is a text-to-speech library for JavaScript that uses the eSpeak speech synthesis engine.

1.3K

Archived

C++

AI Voice & Speech

#text-to-speech#speech-synthesis#javascript

CSTR-Edinburgh/merlin

This open-source Python library is a toolkit for building speech synthesis and voice conversion systems using deep learning.

1.3K

Archived

Python

Speech Synthesis

Voice Conversion

#speech-synthesis#voice-conversion#text-to-speech

Capsize-Games/airunner

Offline AI-powered inference engine for art, chatbots, and automated workflows focused on privacy and self-hosting

1.3K

Stable

Python

Inference

AI Image & Video

PyGame

#ai#image-generation#chatbot

MiniMax-AI/MiniMax-MCP

Official server for the MiniMax Model Context Protocol (MCP) that enables powerful AI capabilities like text-to-speech, image generation, and video generation.

1.3K

Active

Python

MCP Servers

AI Image & Video

Python

#mcp#text-to-speech#image-generation

Stypox/dicio-android

An Android assistant app that uses voice recognition, text-to-speech, and AI skills to provide a personal assistant experience.

1.3K

Active

Kotlin

AI Voice & Speech

Android

#assistant#voice-assistant#android

jishengpeng/WavTokenizer

A state-of-the-art discrete acoustic codec model for audio language modeling with 40/75 tokens per second.

1.3K

Experimental

Python

Speech Representation

Audio Representation

Python

#acoustic#audio-representation#codec

nazdridoy/kokoro-tts

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages and input formats.

1.3K

Stable

Python

AI Voice & Speech

CLI Tools

Python

#tts#audiobook#epub

1 2 3 46 7

Stay in the loop

Get weekly updates on trending AI coding tools and projects.