Explore Projects

Discover 124 open source projects

Active filters (1):

Search: text-to-speech×

Clear all

Showing 61-80 of 124 projects

lifeiteng/vall-e

A PyTorch implementation of VALL-E, a zero-shot text-to-speech model for vibe coders.

2.2K

Stable

Python

LLM Frameworks

AI Voice & Speech

PyTorch

#chatgpt#in-context-learning#large-language-models

DigitalPhonetics/IMS-Toucan

A fast and controllable text-to-speech library supporting over 7000 languages using deep learning and PyTorch.

2.2K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#speech-synthesis#deep-learning

fatchord/WaveRNN

A high-quality neural vocoder and text-to-speech (TTS) library built with PyTorch.

2.2K

Archived

Python

AI Voice & Speech

PyTorch

#neural-vocoder#speech-synthesis#text-to-speech

ming024/FastSpeech2

FastSpeech 2 implementation for high-quality end-to-end text-to-speech

2.2K

Archived

Python

Prompt Engineering

None

React

#text-to-speech#natural-language-processing#machine-learning

r9y9/deepvoice3_pytorch

A PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models.

2.0K

Archived

Python

Speech-to-Text

Speech-Synthesis

PyTorch

#speech-processing#text-to-speech#multi-speaker

cosin2077/easyVoice

An open-source text-to-speech tool supporting long-form text and multi-voice narration.

2.0K

Active

TypeScript

AI Voice & Speech

API Frameworks

TypeScript

#tts#text-to-speech#edge-tts

crow-translate/crow-translate

A lightweight, open-source translator that supports multiple translation engines and features like OCR and text-to-speech.

1.9K

Archived

C++

General Utilities

CLI Tools

#translation#ocr#text-to-speech

miaomiaosoft/PandaOCR.Pro

A multi-functional OCR tool for text recognition, translation, text-to-speech, manga translation, and more.

1.9K

Stable

Computer Vision

File Storage

#ocr#text-recognition#translation

alexpinel/Dot

A developer-focused platform for text-to-speech, RAG, and LLMs, with local-first architecture.

1.9K

Archived

JavaScript

LLM Frameworks

RAG & Vector

React

#text-to-speech#rag#llm

Kyubyong/tacotron

A TensorFlow-based end-to-end text-to-speech synthesis model for vibe coders working on AI-powered applications.

1.8K

Archived

Python

AI Voice & Speech

API Frameworks

TensorFlow

#speech-synthesis#text-to-speech#neural-network

alan-ai/alan-sdk-android

The Alan AI SDK for Android provides a conversational AI platform for building voice assistants and chatbots.

1.8K

Experimental

Prompt Engineering

React

#conversational-ai#voice-assistant#chatbots

alan-ai/alan-sdk-flutter

The Alan AI SDK for Flutter enables building conversational AI-powered apps and voice interfaces.

1.8K

Experimental

Ruby

AI Voice & Speech

Component Libraries (Flutter)

Flutter

#conversational-ai#voice-assistant#speech-recognition

RHVoice/RHVoice

A free and open-source speech synthesizer for Russian and other languages, supporting various platforms.

1.8K

Active

C++

AI Voice & Speech

API Frameworks

#speech-synthesis#text-to-speech#russian

sxzxs/Real-time-translation-typing

Real-time typing translation software with voice-to-text and text-to-speech capabilities for League of Legends players.

1.8K

Experimental

AutoHotkey

Translators

Validation

#translation#voice-to-text#text-to-speech

ThioJoe/Auto-Synced-Translated-Dubs

Automatically translate and dub videos using AI-powered text-to-speech and subtitle synchronization.

1.7K

Active

Python

AI Voice & Speech

Text-to-Speech

Python

#ai#dubbing#subtitles

alan-ai/alan-sdk-ionic

A self-coding system for Ionic apps using AI-powered chatbot and voice assistant SDK.

1.7K

Experimental

TypeScript

React

#ionic#chatbot#conversational-ai

neural-maze/ava-whatsapp-agent-course

A Python-based agent that uses speech recognition and text-to-speech to enable conversational interactions via WhatsApp.

1.6K

Stable

Python

Agents & Orchestration

AI Voice & Speech

#agent#stt#tts

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN repository with PyTorch implementation of various neural vocoders for speech synthesis.

1.6K

Archived

Jupyter Notebook

AI Voice & Speech

Backend Frameworks

PyTorch

#speech-synthesis#neural-vocoder#parallel-wavenet

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K

Active

Swift

AI Voice & Speech

iOS

Swift

#text-to-speech#speech-to-text#voice-activity-detection

Marak/say.js

A simple text-to-speech library for Node.js that allows developers to add voice output to their applications.

1.5K

Archived

JavaScript

API Frameworks

#tts#text-to-speech#audio

1 2 35 6 7

Stay in the loop

Get weekly updates on trending AI coding tools and projects.