Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 41-60 of 368 projects

giscus/giscus

A commenting system powered by GitHub Discussions, allowing developers to add comments to their projects.

11.3K
Experimental
TypeScript
Component Libraries (React)
Next.js
#comments#github-discussions#react

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K
Active
Python
AI SDKs & Wrappers
PyTorch
#speech-recognition#audio-processing#deep-learning

KittenML/KittenTTS

A lightweight, state-of-the-art text-to-speech (TTS) model for developers building AI-powered applications.

11.2K
Stable
Python
AI Voice & Speech
None
#tts#speech-synthesis#ai-audio

SparkAudio/Spark-TTS

Spark-TTS is an open-source Python library for high-quality text-to-speech inference.

10.9K
Experimental
Python
AI Voice & Speech
#text-to-speech#inference#open-source

rhasspy/piper

A fast, local neural text-to-speech system for developers building voice-enabled applications.

10.6K
Stable
C++
AI Voice & Speech
#text-to-speech#tts#speech-synthesis

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K
Active
C++
AI Voice & Speech
#speech-to-text#text-to-speech#offline

AIGC-Audio/AudioGPT

AudioGPT is a powerful tool for understanding and generating speech, music, sound, and talking heads using AI.

10.2K
Archived
Python
AI Voice & Speech
Python
#audio#speech#music

Const-me/Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

10.2K
Archived
C++
Inference
#speech-recognition#whisper#asr

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K
Active
Rust
LLM Frameworks
#ai-meeting-assistant#transcription#speaker-diarization

rany2/edge-tts

A Python library that allows developers to use Microsoft Edge's online text-to-speech service without requiring Edge or an API key.

10.2K
Stable
Python
AI Voice & Speech
#text-to-speech#speech-synthesis#microsoft-edge

mozilla/TTS

A deep learning library for text-to-speech applications, focusing on generating high-quality speech from text.

10.1K
Archived
Jupyter Notebook
AI Voice & Speech
PyTorch
#text-to-speech#speech-generation#deep-learning

openvinotoolkit/openvino

OpenVINO is an open-source toolkit for optimizing and deploying AI inference on a variety of hardware.

9.8K
Active
C++
Inference
#ai#computer-vision#deep-learning

kyutai-labs/moshi

Moshi is an open-source speech-to-text foundation model and dialogue framework for building AI-powered voice apps.

9.8K
Active
Python
LLM Frameworks
Python
#speech-to-text#dialogue-framework#audio-codec

QuentinFuxa/WhisperLiveKit

A simultaneous speech-to-text model powered by the Whisper AI library for real-time transcription.

9.8K
Active
Python
AI Voice & Speech
Python
#speech-to-text#transcription#whisper

espnet/espnet

End-to-end speech processing toolkit for tasks like speech recognition, synthesis, translation, and more.

9.8K
Active
Python
Speech & Voice
PyTorch
#speech-recognition#speech-synthesis#speech-translation

open-mmlab/Amphion

Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.

9.7K
Experimental
Python
AI Audio & Speech
Python
#audio-generation#speech-synthesis#text-to-speech

KoljaB/RealtimeSTT

A robust, efficient, and low-latency speech-to-text library with advanced voice activity detection and wake word activation.

9.5K
Experimental
Python
AI Voice & Speech
#realtime#speech-to-text#voice-activity-detection

sloria/TextBlob

TextBlob is a simple, Pythonic library for natural language processing tasks like sentiment analysis, part-of-speech tagging, and more.

9.5K
Active
Python
Natural Language Processing
#nlp#sentiment-analysis#part-of-speech-tagging

voicepaw/so-vits-svc-fork

A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.

9.3K
Active
Python
AI Voice & Speech
PyTorch
#voice-conversion#speech-synthesis#realtime

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K
Active
Jupyter Notebook
Speech Processing
API Frameworks
PyTorch
#speech-recognition#speaker-diarization#audio-processing
124...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.