Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 61-80 of 368 projects

xorbitsai/inference

Unified, production-ready inference API to run open-source, speech, and multimodal models on cloud, on-prem, or your laptop.

9.1K
Active
Python
LLM Frameworks
Inference
PyTorch
#artificial-intelligence#llm#inference

QwenLM/Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models that enable stable, expressive, and streaming speech generation.

9.0K
Active
Python
AI Voice & Speech
AI App Builders
Python
#tts#text-to-speech#speech-generation

Uberi/speech_recognition

Python speech recognition library supporting multiple engines and APIs, both online and offline.

9.0K
Active
Python
AI Voice & Speech
Python
#speech-recognition#speech-to-text#audio

jianchang512/clone-voice

A sound cloning tool that lets you use your voice or any sound to record audio, with a web interface.

8.9K
Stable
Python
AI Voice & Speech
Frontend Frameworks
React
#clonevoice#speech-analysis#tts

Vaibhavs10/insanely-fast-whisper

An open-source library for running the Whisper AI speech recognition model efficiently on a variety of platforms.

8.8K
Stable
Jupyter Notebook
LLM Wrappers & SDKs
API Frameworks
React
#speech-recognition#whisper#llm

modelscope/modelscope

ModelScope is an open-source AI framework that brings the notion of Model-as-a-Service to life, providing a comprehensive suite of tools for building, deploying, and managing AI models.

8.8K
Active
Python
LLM Frameworks
Computer Vision
Python
#machine-learning#deep-learning#computer-vision

fishaudio/Bert-VITS2

Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.

8.7K
Active
Python
LLM Frameworks
React
#LLM#TTS#VITS2

jasonppy/VoiceCraft

A Jupyter Notebook project for zero-shot speech editing and text-to-speech using AI models.

8.5K
Experimental
Jupyter Notebook
AI Voice & Speech
Notebooks
#zero-shot#speech-editing#text-to-speech

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K
Archived
Python
AI Voice & Speech
Machine Learning Ops
PyTorch
#text-to-speech#multi-speaker#emotion

snakers4/silero-vad

Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.

8.4K
Stable
Python
AI Voice & Speech
PyTorch
#speech-processing#voice-activity-detection#voice-commands

nl8590687/ASRT_SpeechRecognition

A deep learning-based Chinese speech recognition system for developers working on AI-powered speech applications.

8.4K
Stable
Python
AI Voice & Speech
API Frameworks
TensorFlow
#speech-recognition#speech-to-text#chinese

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.

8.0K
Archived
Python
LLM Wrappers & SDKs
AI Voice & Speech
Python
#emotional-speech#text-to-speech#transformer-architecture

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K
Archived
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#deep-learning#speech-synthesis

BasedHardware/omi

AI-powered wearable device that transcribes speech and summarizes conversations for developers.

7.8K
Active
Dart
AI Voice & Speech
Cross-Platform
Flutter
#ai#transcription#wearable

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K
Stable
Python
AI Voice & Speech
API Frameworks
PyTorch
#speech-recognition#speech-emotion-recognition#audio-event-classification

smacke/ffsubsync

Automagically synchronize subtitles with video using audio alignment and speech detection.

7.6K
Stable
Python
AI Audio & Speech
API Frameworks
#audio-alignment#speech-detection#subtitle-synchronization

jianchang512/ChatTTS-ui

A simple native web interface for ChatTTS text-to-speech synthesis with API support.

7.5K
Stable
Python
AI Code Editors
React
#ChatTTS#tts#AI Coding Tools

soimort/translate-shell

A command-line translator using popular translation services like Google Translate and Bing Translator.

7.4K
Archived
Awk
API Clients & Testing
CLI Tools
#translation#command-line#google-translate

myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library supporting English, Spanish, French, Chinese, Japanese and Korean.

7.2K
Archived
Python
AI Voice & Speech
Backend Frameworks
Python
#text-to-speech#multilingual#audio-generation

Zyphra/Zonos

Zonos is an open-source, high-quality text-to-speech model for developers building AI-powered applications.

7.2K
Experimental
Python
AI Voice & Speech
BaaS Platforms
Python
#text-to-speech#multilingual#open-source
1...35...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.