Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Clear all

Showing 101-120 of 368 projects

MoonInTheRiver/DiffSinger

DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.

4.7K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#singing-synthesis#text-to-speech#diffusion-model

Picovoice/porcupine

Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.

4.7K

Active

Python

AI Voice & Speech

CLI Tools

Python

#speech-recognition#voice-activation#wake-word-detection

sanchit-gandhi/whisper-jax

A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.

4.7K

Archived

Jupyter Notebook

LLM Frameworks

Speech-to-Text

JAX

#speech-recognition#whisper#jax

gradio-app/fastrtc

A Python library for building real-time communication applications using AI tools like speech-to-text and text-to-speech.

4.5K

Active

JavaScript

AI Voice & Speech

Realtime

Python

#real-time#speech-to-text#text-to-speech

remsky/Kokoro-FastAPI

A Dockerized FastAPI wrapper for the Kokoro-82M text-to-speech model with CPU and GPU support.

4.5K

Active

Python

AI Voice & Speech

FastAPI

#tts-api#fastapi#onnx

huggingface/speech-to-speech

Open-source and modular AI-powered speech-to-speech translation tool built with Python.

4.5K

Experimental

Python

Speech & Voice

API Frameworks

Python

#speech-recognition#speech-synthesis#speech-translation

OptiKey/OptiKey

OptiKey is a C# library for full computer control and speech with your eyes.

4.4K

Active

React

#authentication#accessibility#eye-tracking

fixie-ai/ultravox

A fast, multimodal LLM for real-time voice applications and AI-powered speech tools.

4.4K

Stable

Python

LLM Frameworks

AI Voice & Speech

Python

#llm#speech-recognition#text-to-speech

cmusphinx/pocketsphinx

A small speech recognition library written in C that can be used in a variety of applications.

4.3K

Active

AI Voice & Speech

#speech-recognition#c#voice

Beingpax/VoiceInk

A voice-to-text app for macOS that transcribes speech to text almost instantly.

4.1K

Active

Swift

Component Libraries (Swift)

AI Voice & Speech

#macos#transcription#voice-to-text

TensorSpeech/TensorFlowTTS

A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.

4.0K

Archived

Python

AI Voice & Speech

TensorFlow

#speech-synthesis#text-to-speech#tts

baidu/lac

A Chinese NLP library for tokenization, part-of-speech tagging, named entity recognition, and lexical analysis.

4.0K

Archived

C++

NLP Frameworks

API Frameworks

Java

#chinese-nlp#tokenization#part-of-speech-tagging

modelscope/ClearerVoice-Studio

An open-source toolkit for speech processing, supporting enhancement, separation, and target speaker extraction.

4.0K

Stable

Python

AI Voice & Speech

PyTorch

#speech-enhancement#speech-separation#speaker-extraction

google/lyra

A very low-bitrate speech codec for efficient audio compression, useful for various applications.

3.9K

Archived

C++

Audio & Speech

API Frameworks

C++

#audio-compression#speech-codec#low-bitrate

QwenLM/Qwen2.5-Omni

An end-to-end multimodal AI model that can understand and generate text, audio, vision, and video in real-time.

3.9K

Experimental

Jupyter Notebook

LLM Frameworks

AI Voice & Speech

Jupyter Notebook

#multimodal#text-to-speech#speech-recognition

Rikorose/DeepFilterNet

A deep learning-based noise suppression library for audio and speech enhancement applications.

3.9K

Archived

Python

Speech & Audio

API Frameworks

PyTorch

#noise-suppression#speech-enhancement#audio-processing

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper, a powerful speech recognition and translation tool.

3.9K

Active

Python

LLM Frameworks

AI Voice & Speech

Python

#whisper#speech-recognition#translation

KoljaB/RealtimeTTS

A Python library that provides real-time text-to-speech conversion capabilities for developers.

3.8K

Active

Python

AI Voice & Speech

#realtime#speech-synthesis#text-to-speech

stakira/OpenUtau

An open-source successor to UTAU, a platform for singing voice synthesis and audio production.

3.6K

Active

AI Voice & Speech

Audio Production

#singing-synthesis#vocal-synthesis#speech-synthesis

avinashkranjan/Amazing-Python-Scripts

A curated collection of Python scripts from basics to advanced, including automation tasks.

3.5K

Active

Jupyter Notebook

AI Voice & Speech

Backend Frameworks

Python

#python#scripts#automation

1...57...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.