Explore Projects

Discover 124 open source projects

Active filters (1):
Search: text-to-speechร—
Clear all

Showing 61-80 of 124 projects

lifeiteng/vall-e

A PyTorch implementation of VALL-E, a zero-shot text-to-speech model for vibe coders.

2.2K
Stable
Python
LLM Frameworks
AI Voice & Speech
PyTorch
#chatgpt#in-context-learning#large-language-models

DigitalPhonetics/IMS-Toucan

A fast and controllable text-to-speech library supporting over 7000 languages using deep learning and PyTorch.

2.2K
Active
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#speech-synthesis#deep-learning

fatchord/WaveRNN

A high-quality neural vocoder and text-to-speech (TTS) library built with PyTorch.

2.2K
Archived
Python
AI Voice & Speech
PyTorch
#neural-vocoder#speech-synthesis#text-to-speech

ming024/FastSpeech2

FastSpeech 2 implementation for high-quality end-to-end text-to-speech

2.2K
Archived
Python
Prompt Engineering
None
React
#text-to-speech#natural-language-processing#machine-learning

r9y9/deepvoice3_pytorch

A PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models.

2.0K
Archived
Python
Speech-to-Text
Speech-Synthesis
PyTorch
#speech-processing#text-to-speech#multi-speaker

cosin2077/easyVoice

An open-source text-to-speech tool supporting long-form text and multi-voice narration.

2.0K
Active
TypeScript
AI Voice & Speech
API Frameworks
TypeScript
#tts#text-to-speech#edge-tts

crow-translate/crow-translate

A lightweight, open-source translator that supports multiple translation engines and features like OCR and text-to-speech.

1.9K
Archived
C++
General Utilities
CLI Tools
#translation#ocr#text-to-speech

miaomiaosoft/PandaOCR.Pro

A multi-functional OCR tool for text recognition, translation, text-to-speech, manga translation, and more.

1.9K
Stable
Computer Vision
File Storage
#ocr#text-recognition#translation

alexpinel/Dot

A developer-focused platform for text-to-speech, RAG, and LLMs, with local-first architecture.

1.9K
Archived
JavaScript
LLM Frameworks
RAG & Vector
React
#text-to-speech#rag#llm

Kyubyong/tacotron

A TensorFlow-based end-to-end text-to-speech synthesis model for vibe coders working on AI-powered applications.

1.8K
Archived
Python
AI Voice & Speech
API Frameworks
TensorFlow
#speech-synthesis#text-to-speech#neural-network

alan-ai/alan-sdk-android

The Alan AI SDK for Android provides a conversational AI platform for building voice assistants and chatbots.

1.8K
Experimental
Prompt Engineering
React
#conversational-ai#voice-assistant#chatbots

alan-ai/alan-sdk-flutter

The Alan AI SDK for Flutter enables building conversational AI-powered apps and voice interfaces.

1.8K
Experimental
Ruby
AI Voice & Speech
Component Libraries (Flutter)
Flutter
#conversational-ai#voice-assistant#speech-recognition

RHVoice/RHVoice

A free and open-source speech synthesizer for Russian and other languages, supporting various platforms.

1.8K
Active
C++
AI Voice & Speech
API Frameworks
#speech-synthesis#text-to-speech#russian

sxzxs/Real-time-translation-typing

Real-time typing translation software with voice-to-text and text-to-speech capabilities for League of Legends players.

1.8K
Experimental
AutoHotkey
Translators
Validation
#translation#voice-to-text#text-to-speech

ThioJoe/Auto-Synced-Translated-Dubs

Automatically translate and dub videos using AI-powered text-to-speech and subtitle synchronization.

1.7K
Active
Python
AI Voice & Speech
Text-to-Speech
Python
#ai#dubbing#subtitles

alan-ai/alan-sdk-ionic

A self-coding system for Ionic apps using AI-powered chatbot and voice assistant SDK.

1.7K
Experimental
TypeScript
React
#ionic#chatbot#conversational-ai

neural-maze/ava-whatsapp-agent-course

A Python-based agent that uses speech recognition and text-to-speech to enable conversational interactions via WhatsApp.

1.6K
Stable
Python
Agents & Orchestration
AI Voice & Speech
#agent#stt#tts

kan-bayashi/ParallelWaveGAN

Unofficial Parallel WaveGAN repository with PyTorch implementation of various neural vocoders for speech synthesis.

1.6K
Archived
Jupyter Notebook
AI Voice & Speech
Backend Frameworks
PyTorch
#speech-synthesis#neural-vocoder#parallel-wavenet

FluidInference/FluidAudio

Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.

1.6K
Active
Swift
AI Voice & Speech
iOS
Swift
#text-to-speech#speech-to-text#voice-activity-detection

Marak/say.js

A simple text-to-speech library for Node.js that allows developers to add voice output to their applications.

1.5K
Archived
JavaScript
API Frameworks
#tts#text-to-speech#audio

Stay in the loop

Get weekly updates on trending AI coding tools and projects.