Explore Projects

Discover 124 open source projects

Active filters (1):
Search: text-to-speechร—
Clear all

Showing 81-100 of 124 projects

BytedanceSpeech/seed-tts-eval

This Python repository provides an evaluation framework for text-to-speech models, focusing on enabling vibe coder development with AI tools.

1.5K
Archived
Python
AI Voice & Speech
Testing
Python
#text-to-speech#speech-synthesis#model-evaluation

voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

1.4K
Archived
Python
Prompt Engineering
React
#text-to-speech#tts#voice-cloning

edwko/OuteTTS

Interface for OuteTTS models, a Python library for text-to-speech using transformer-based models.

1.4K
Experimental
Python
LLM Frameworks
AI Voice & Speech
#text-to-speech#transformers#llama

High-Logic/Genie-TTS

A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.

1.4K
Active
Python
AI Voice & Speech
CLI Tools
Python
#gpt-sovits#text-to-speech#tts

DragonComputer/Dragonfire

An open-source virtual assistant for Ubuntu-based Linux distributions, focused on speech recognition and natural language processing.

1.4K
Archived
Python
AI Voice & Speech
API Frameworks
#speech-recognition#natural-language-processing#ubuntu

innnky/emotional-vits

An emotion-controllable text-to-speech model for vibe coders, built on the VITS framework.

1.4K
Archived
Jupyter Notebook
AI Voice & Speech
Jupyter Notebook
#text-to-speech#emotion-control#ai-voice

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K
Archived
AI Voice & Speech
Databases
#speech-recognition#speech-synthesis#speech-processing

lenML/Speech-AI-Forge

A Python-based project that provides a TTS API server and Gradio-based web UI for speech synthesis and voice generation.

1.4K
Active
Python
AI Voice & Speech
API Frameworks
Gradio
#text-to-speech#speech-synthesis#api-server

mkiol/dsnote

A Linux app for speech-to-text, text-to-speech, and machine translation, with offline capabilities.

1.4K
Active
C++
AI Voice & Speech
API Frameworks
#speech-recognition#speech-synthesis#machine-translation

Enemyx-net/VibeVoice-ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

1.4K
Stable
Python
AI Voice & Speech
ComfyUI Custom Nodes
React
#text-to-speech#voice-cloning#ComfyUI

LuckyHookin/edge-TTS-record

A tool to record Microsoft Edge browser's text-to-speech (TTS) audio and output it as .wav files on Windows.

1.4K
Archived
HTML
Frontend Frameworks
CLI Tools
Vue.js
#edge#tts#audio-recording

lamm-mit/PDF2Audio

A Jupyter Notebook project that converts PDF documents to audio using AI-powered text-to-speech.

1.4K
Experimental
Jupyter Notebook
AI Voice & Speech
Jupyter Notebook
#text-to-speech#pdf#audio

R3gm/SoniTranslate

Synchronized Translation for Videos: Automatic dubbing and subtitling for video content.

1.3K
Stable
Python
AI Voice & Speech
CMS & Content
#video-dubbing#speech-to-text#text-to-speech

kripken/speak.js

speak.js is a text-to-speech library for JavaScript that uses the eSpeak speech synthesis engine.

1.3K
Archived
C++
AI Voice & Speech
#text-to-speech#speech-synthesis#javascript

CSTR-Edinburgh/merlin

This open-source Python library is a toolkit for building speech synthesis and voice conversion systems using deep learning.

1.3K
Archived
Python
Speech Synthesis
Voice Conversion
#speech-synthesis#voice-conversion#text-to-speech

Capsize-Games/airunner

Offline AI-powered inference engine for art, chatbots, and automated workflows focused on privacy and self-hosting

1.3K
Stable
Python
Inference
AI Image & Video
PyGame
#ai#image-generation#chatbot

MiniMax-AI/MiniMax-MCP

Official server for the MiniMax Model Context Protocol (MCP) that enables powerful AI capabilities like text-to-speech, image generation, and video generation.

1.3K
Active
Python
MCP Servers
AI Image & Video
Python
#mcp#text-to-speech#image-generation

Stypox/dicio-android

An Android assistant app that uses voice recognition, text-to-speech, and AI skills to provide a personal assistant experience.

1.3K
Active
Kotlin
AI Voice & Speech
Android
#assistant#voice-assistant#android

jishengpeng/WavTokenizer

A state-of-the-art discrete acoustic codec model for audio language modeling with 40/75 tokens per second.

1.3K
Experimental
Python
Speech Representation
Audio Representation
Python
#acoustic#audio-representation#codec

nazdridoy/kokoro-tts

A CLI text-to-speech tool using the Kokoro model, supporting multiple languages and input formats.

1.3K
Stable
Python
AI Voice & Speech
CLI Tools
Python
#tts#audiobook#epub

Stay in the loop

Get weekly updates on trending AI coding tools and projects.