Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 301-320 of 368 projects

ictnlp/StreamSpeech

An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.

1.2K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#speech-translation#speech-synthesis

YannickJadoul/Parselmouth

Praat in Python, a Python library for speech analysis and manipulation.

1.2K
Active
C++
CLI Tools
Libraries
#speech-analysis#speech-processing#audio-manipulation

alphacep/vosk-server

A speech recognition server based on Vosk and Kaldi libraries, supporting WebSocket, gRPC, and WebRTC protocols.

1.2K
Experimental
Python
AI Voice & Speech
BaaS Platforms
Python
#speech-recognition#asr#kaldi

gitmylo/audio-webui

An all-in-one web UI for different audio-related neural networks, including text-to-speech, voice cloning, and generative music.

1.2K
Experimental
Python
LLM Wrappers & SDKs
AI Voice & Speech
React
#audio#neural-networks#text-to-speech

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.

1.2K
Archived
Python
Audio & Speech
Signal Processing
PyTorch
#audio-processing#speech-recognition#signal-processing

Softcatala/whisper-ctranslate2

A Python command-line client for the Whisper speech-to-text model by OpenAI, using the CTranslate2 library.

1.2K
Stable
Python
LLM Wrappers & SDKs
API Frameworks
Python
#openai-whisper#speech-recognition#speech-to-text

PlayVoice/vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

1.2K
Archived
Python
AI Voice & Speech
API Frameworks
#tts#bert#vits

WenzheLiu-Speech/awesome-speech-enhancement

A collection of resources for speech enhancement, speech separation, and sound source localization.

1.2K
Archived
AI Voice & Speech
#speech-enhancement#speech-separation#sound-source-localization

Alexander-H-Liu/End-to-end-ASR-Pytorch

Open-source PyTorch implementation of an end-to-end automatic speech recognition (ASR) system.

1.2K
Archived
Python
Speech & Voice
API Frameworks
PyTorch
#speech-recognition#asr#pytorch

ga642381/speech-trident

A collection of high-quality open-source speech, audio, and codec models for building AI-powered speech applications.

1.2K
Stable
LLM Frameworks
AI Voice & Speech
Node
#speech-recognition#audio-processing#codec-models

xiph/LPCNet

LPCNet is an efficient neural speech synthesis library for developers building voice-based applications.

1.2K
Archived
C
AI Voice & Speech
#speech-synthesis#neural-networks#efficiency

hgneng/ekho

A Chinese text-to-speech engine supporting Cantonese, Tibetan and other languages.

1.2K
Experimental
Lex
AI Voice & Speech
API Frameworks
#text-to-speech#chinese#cantonese

ekwek1/soprano

Soprano is a Python library that provides ultra-realistic text-to-speech capabilities.

1.2K
Active
Python
AI Voice & Speech
#text-to-speech#tts#voice-synthesis

yeyupiaoling/Whisper-Finetune

Fine-tune and deploy the Whisper speech recognition model with accelerated inference and support for various platforms.

1.2K
Stable
C
Fine-tuning
Inference
PyTorch
#speech-recognition#whisper#fine-tune

modal-labs/quillman

A voice chat app built with Python that leverages AI-powered speech recognition and text-to-speech capabilities.

1.2K
Experimental
Python
AI Voice & Speech
Serverless
#speech-recognition#text-to-speech#serverless

OpenMOSS/MOSS-TTSD

An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.

1.2K
Stable
Python
AI Voice & Speech
API Frameworks
#speech-dialogue-generation#multi-speaker-voice-cloning#long-form-speech-generation

maum-ai/voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system for audio separation.

1.2K
Archived
Python
Audio Voice & Speech
API Frameworks
PyTorch
#audio-separation#speech-separation#source-separation

unitaryai/detoxify

Detoxify is a Python library with trained models to detect toxic comments, built using Pytorch Lightning and Transformers.

1.2K
Active
Python
Computer Vision
API Development
Pytorch Lightning
#bert#nlp#toxic-comments

TencentGameMate/chinese_speech_pretrain

This repository contains pre-trained Chinese speech models for developers working with speech AI.

1.2K
Archived
Shell
AI Voice & Speech
#speech-recognition#chinese-language#pre-trained-models

PantoMatrix/PantoMatrix

PantoMatrix is a Python library for generating facial and body animations from speech, designed for vibe coders building AI-powered projects.

1.2K
Archived
Python
AI Voice & Speech
Computer Vision
#speech-to-animation#face-generation#body-animation
1...1517...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.