Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Clear all

Showing 41-60 of 368 projects

giscus/giscus

A commenting system powered by GitHub Discussions, allowing developers to add comments to their projects.

11.3K

Experimental

TypeScript

Component Libraries (React)

Next.js

#comments#github-discussions#react

speechbrain/speechbrain

A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.

11.3K

Active

Python

AI SDKs & Wrappers

PyTorch

#speech-recognition#audio-processing#deep-learning

KittenML/KittenTTS

A lightweight, state-of-the-art text-to-speech (TTS) model for developers building AI-powered applications.

11.2K

Stable

Python

AI Voice & Speech

None

#tts#speech-synthesis#ai-audio

SparkAudio/Spark-TTS

Spark-TTS is an open-source Python library for high-quality text-to-speech inference.

10.9K

Experimental

Python

AI Voice & Speech

#text-to-speech#inference#open-source

rhasspy/piper

A fast, local neural text-to-speech system for developers building voice-enabled applications.

10.6K

Stable

C++

AI Voice & Speech

#text-to-speech#tts#speech-synthesis

k2-fsa/sherpa-onnx

An offline-capable speech processing library for embedded systems, supporting a wide range of languages and platforms.

10.6K

Active

C++

AI Voice & Speech

#speech-to-text#text-to-speech#offline

AIGC-Audio/AudioGPT

AudioGPT is a powerful tool for understanding and generating speech, music, sound, and talking heads using AI.

10.2K

Archived

Python

AI Voice & Speech

Python

#audio#speech#music

Const-me/Whisper

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

10.2K

Archived

C++

Inference

#speech-recognition#whisper#asr

Zackriya-Solutions/meetily

A privacy-focused, open-source AI meeting assistant with faster transcription, speaker diarization, and summarization, built on Rust.

10.2K

Active

Rust

LLM Frameworks

#ai-meeting-assistant#transcription#speaker-diarization

rany2/edge-tts

A Python library that allows developers to use Microsoft Edge's online text-to-speech service without requiring Edge or an API key.

10.2K

Stable

Python

AI Voice & Speech

#text-to-speech#speech-synthesis#microsoft-edge

mozilla/TTS

A deep learning library for text-to-speech applications, focusing on generating high-quality speech from text.

10.1K

Archived

Jupyter Notebook

AI Voice & Speech

PyTorch

#text-to-speech#speech-generation#deep-learning

A robust, efficient, and low-latency speech-to-text library with advanced voice activity detection and wake word activation.

9.5K

Experimental

Python

AI Voice & Speech

#realtime#speech-to-text#voice-activity-detection

sloria/TextBlob

TextBlob is a simple, Pythonic library for natural language processing tasks like sentiment analysis, part-of-speech tagging, and more.

9.5K

Active

Python

Natural Language Processing

#nlp#sentiment-analysis#part-of-speech-tagging

voicepaw/so-vits-svc-fork

A fork of the so-vits-svc project with realtime support, improved interface, and more features for AI-powered voice conversion.

9.3K

Active

Python

AI Voice & Speech

PyTorch

#voice-conversion#speech-synthesis#realtime

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K

Active

Jupyter Notebook

Speech Processing

API Frameworks

PyTorch

#speech-recognition#speaker-diarization#audio-processing

1 24...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.