Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Showing 61-80 of 368 projects

xorbitsai/inference

Unified, production-ready inference API to run open-source, speech, and multimodal models on cloud, on-prem, or your laptop.

9.1K

Active

Python

LLM Frameworks

Inference

PyTorch

#artificial-intelligence#llm#inference

QwenLM/Qwen3-TTS

Qwen3-TTS is an open-source series of TTS models that enable stable, expressive, and streaming speech generation.

9.0K

Active

Python

AI Voice & Speech

AI App Builders

Python

#tts#text-to-speech#speech-generation

Uberi/speech_recognition

Python speech recognition library supporting multiple engines and APIs, both online and offline.

9.0K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-to-text#audio

jianchang512/clone-voice

A sound cloning tool that lets you use your voice or any sound to record audio, with a web interface.

8.9K

Stable

Python

AI Voice & Speech

Frontend Frameworks

React

#clonevoice#speech-analysis#tts

Vaibhavs10/insanely-fast-whisper

An open-source library for running the Whisper AI speech recognition model efficiently on a variety of platforms.

8.8K

Stable

Jupyter Notebook

LLM Wrappers & SDKs

API Frameworks

React

#speech-recognition#whisper#llm

modelscope/modelscope

ModelScope is an open-source AI framework that brings the notion of Model-as-a-Service to life, providing a comprehensive suite of tools for building, deploying, and managing AI models.

8.8K

Active

Python

LLM Frameworks

Computer Vision

Python

#machine-learning#deep-learning#computer-vision

fishaudio/Bert-VITS2

Bert-VITS2 is a Python library that implements the VITS2 backbone with multilingual-BERT for speech synthesis and text-to-speech applications.

8.7K

Active

Python

LLM Frameworks

React

#LLM#TTS#VITS2

jasonppy/VoiceCraft

A Jupyter Notebook project for zero-shot speech editing and text-to-speech using AI models.

8.5K

Experimental

Jupyter Notebook

AI Voice & Speech

Notebooks

#zero-shot#speech-editing#text-to-speech

netease-youdao/EmotiVoice

EmotiVoice is a multi-voice and prompt-controlled TTS engine built with PyTorch for developers working with AI voice tools.

8.5K

Archived

Python

AI Voice & Speech

Machine Learning Ops

PyTorch

#text-to-speech#multi-speaker#emotion

snakers4/silero-vad

Silero VAD is a pre-trained enterprise-grade Voice Activity Detector library for Python.

8.4K

Stable

Python

AI Voice & Speech

PyTorch

#speech-processing#voice-activity-detection#voice-commands

nl8590687/ASRT_SpeechRecognition

A deep learning-based Chinese speech recognition system for developers working on AI-powered speech applications.

8.4K

Stable

Python

AI Voice & Speech

API Frameworks

TensorFlow

#speech-recognition#speech-to-text#chinese

Plachtaa/VALL-E-X

An open-source implementation of Microsoft's VALL-E X zero-shot text-to-speech model, enabling voice cloning and emotional speech synthesis.

8.0K

Archived

Python

LLM Wrappers & SDKs

AI Voice & Speech

Python

#emotional-speech#text-to-speech#transformer-architecture

jaywalnut310/vits

A PyTorch-based text-to-speech model that generates high-quality speech with expressive prosody.

7.8K

Archived

Python

AI Voice & Speech

API Frameworks

PyTorch

#text-to-speech#deep-learning#speech-synthesis

BasedHardware/omi

AI-powered wearable device that transcribes speech and summarizes conversations for developers.

7.8K

Active

Dart

AI Voice & Speech

Cross-Platform

Flutter

#ai#transcription#wearable

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K

Stable

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-recognition#speech-emotion-recognition#audio-event-classification

smacke/ffsubsync

Automagically synchronize subtitles with video using audio alignment and speech detection.

7.6K

Stable

Python

AI Audio & Speech

API Frameworks

#audio-alignment#speech-detection#subtitle-synchronization

jianchang512/ChatTTS-ui

A simple native web interface for ChatTTS text-to-speech synthesis with API support.

7.5K

Stable

Python

AI Code Editors

React

#ChatTTS#tts#AI Coding Tools

soimort/translate-shell

A command-line translator using popular translation services like Google Translate and Bing Translator.

7.4K

Archived

Awk

API Clients & Testing

CLI Tools

#translation#command-line#google-translate

myshell-ai/MeloTTS

High-quality multi-lingual text-to-speech library supporting English, Spanish, French, Chinese, Japanese and Korean.

7.2K

Archived

Python

AI Voice & Speech

Backend Frameworks

Python

#text-to-speech#multilingual#audio-generation

Zyphra/Zonos

Zonos is an open-source, high-quality text-to-speech model for developers building AI-powered applications.

7.2K

Experimental

Python

AI Voice & Speech

BaaS Platforms

Python

#text-to-speech#multilingual#open-source

1...35...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.