Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Clear all

Showing 301-320 of 368 projects

ictnlp/StreamSpeech

An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.

1.2K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#speech-recognition#speech-translation#speech-synthesis

YannickJadoul/Parselmouth

Praat in Python, a Python library for speech analysis and manipulation.

1.2K

Active

C++

CLI Tools

Libraries

#speech-analysis#speech-processing#audio-manipulation

alphacep/vosk-server

A speech recognition server based on Vosk and Kaldi libraries, supporting WebSocket, gRPC, and WebRTC protocols.

1.2K

Experimental

Python

AI Voice & Speech

BaaS Platforms

Python

#speech-recognition#asr#kaldi

gitmylo/audio-webui

An all-in-one web UI for different audio-related neural networks, including text-to-speech, voice cloning, and generative music.

1.2K

Experimental

Python

LLM Wrappers & SDKs

AI Voice & Speech

React

#audio#neural-networks#text-to-speech

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.

1.2K

Archived

Python

Audio & Speech

Signal Processing

PyTorch

#audio-processing#speech-recognition#signal-processing

Softcatala/whisper-ctranslate2

A Python command-line client for the Whisper speech-to-text model by OpenAI, using the CTranslate2 library.

1.2K

Stable

Python

LLM Wrappers & SDKs

API Frameworks

Python

#openai-whisper#speech-recognition#speech-to-text

PlayVoice/vits_chinese

Best practice TTS based on BERT and VITS with some Natural Speech Features Of Microsoft; Support ONNX streaming out!

1.2K

Archived

Python

AI Voice & Speech

API Frameworks

#tts#bert#vits

WenzheLiu-Speech/awesome-speech-enhancement

A collection of resources for speech enhancement, speech separation, and sound source localization.

1.2K

Archived

AI Voice & Speech

#speech-enhancement#speech-separation#sound-source-localization

Alexander-H-Liu/End-to-end-ASR-Pytorch

Open-source PyTorch implementation of an end-to-end automatic speech recognition (ASR) system.

1.2K

Archived

Python

Speech & Voice

API Frameworks

PyTorch

#speech-recognition#asr#pytorch

ga642381/speech-trident

A collection of high-quality open-source speech, audio, and codec models for building AI-powered speech applications.

1.2K

Stable

LLM Frameworks

AI Voice & Speech

Node

#speech-recognition#audio-processing#codec-models

xiph/LPCNet

LPCNet is an efficient neural speech synthesis library for developers building voice-based applications.

1.2K

Archived

AI Voice & Speech

#speech-synthesis#neural-networks#efficiency

hgneng/ekho

A Chinese text-to-speech engine supporting Cantonese, Tibetan and other languages.

1.2K

Experimental

Lex

AI Voice & Speech

API Frameworks

#text-to-speech#chinese#cantonese

ekwek1/soprano

Soprano is a Python library that provides ultra-realistic text-to-speech capabilities.

1.2K

Active

Python

AI Voice & Speech

#text-to-speech#tts#voice-synthesis

yeyupiaoling/Whisper-Finetune

Fine-tune and deploy the Whisper speech recognition model with accelerated inference and support for various platforms.

1.2K

Stable

Fine-tuning

Inference

PyTorch

#speech-recognition#whisper#fine-tune

modal-labs/quillman

A voice chat app built with Python that leverages AI-powered speech recognition and text-to-speech capabilities.

1.2K

Experimental

Python

AI Voice & Speech

Serverless

#speech-recognition#text-to-speech#serverless

OpenMOSS/MOSS-TTSD

An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.

1.2K

Stable

Python

AI Voice & Speech

API Frameworks

#speech-dialogue-generation#multi-speaker-voice-cloning#long-form-speech-generation

maum-ai/voicefilter

Unofficial PyTorch implementation of Google AI's VoiceFilter system for audio separation.

1.2K

Archived

Python

Audio Voice & Speech

API Frameworks

PyTorch

#audio-separation#speech-separation#source-separation

unitaryai/detoxify

Detoxify is a Python library with trained models to detect toxic comments, built using Pytorch Lightning and Transformers.

1.2K

Active

Python

Computer Vision

API Development

Pytorch Lightning

#bert#nlp#toxic-comments

TencentGameMate/chinese_speech_pretrain

This repository contains pre-trained Chinese speech models for developers working with speech AI.

1.2K

Archived

Shell

AI Voice & Speech

#speech-recognition#chinese-language#pre-trained-models

PantoMatrix/PantoMatrix

PantoMatrix is a Python library for generating facial and body animations from speech, designed for vibe coders building AI-powered projects.

1.2K

Archived

Python

AI Voice & Speech

Computer Vision

#speech-to-animation#face-generation#body-animation

1...1517...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.