Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 101-120 of 368 projects

MoonInTheRiver/DiffSinger

DiffSinger is a singing voice synthesis system using a shallow diffusion mechanism, enabling efficient TTS and SVS.

4.7K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#singing-synthesis#text-to-speech#diffusion-model

Picovoice/porcupine

Porcupine is an on-device wake word detection library powered by deep learning, enabling hands-free voice activation in AI apps.

4.7K
Active
Python
AI Voice & Speech
CLI Tools
Python
#speech-recognition#voice-activation#wake-word-detection

sanchit-gandhi/whisper-jax

A JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU for speech-to-text tasks.

4.7K
Archived
Jupyter Notebook
LLM Frameworks
Speech-to-Text
JAX
#speech-recognition#whisper#jax

gradio-app/fastrtc

A Python library for building real-time communication applications using AI tools like speech-to-text and text-to-speech.

4.5K
Active
JavaScript
AI Voice & Speech
Realtime
Python
#real-time#speech-to-text#text-to-speech

remsky/Kokoro-FastAPI

A Dockerized FastAPI wrapper for the Kokoro-82M text-to-speech model with CPU and GPU support.

4.5K
Active
Python
AI Voice & Speech
FastAPI
#tts-api#fastapi#onnx

huggingface/speech-to-speech

Open-source and modular AI-powered speech-to-speech translation tool built with Python.

4.5K
Experimental
Python
Speech & Voice
API Frameworks
Python
#speech-recognition#speech-synthesis#speech-translation

OptiKey/OptiKey

OptiKey is a C# library for full computer control and speech with your eyes.

4.4K
Active
C#
React
#authentication#accessibility#eye-tracking

fixie-ai/ultravox

A fast, multimodal LLM for real-time voice applications and AI-powered speech tools.

4.4K
Stable
Python
LLM Frameworks
AI Voice & Speech
Python
#llm#speech-recognition#text-to-speech

cmusphinx/pocketsphinx

A small speech recognition library written in C that can be used in a variety of applications.

4.3K
Active
C
AI Voice & Speech
#speech-recognition#c#voice

Beingpax/VoiceInk

A voice-to-text app for macOS that transcribes speech to text almost instantly.

4.1K
Active
Swift
Component Libraries (Swift)
AI Voice & Speech
#macos#transcription#voice-to-text

TensorSpeech/TensorFlowTTS

A real-time state-of-the-art speech synthesis library for TensorFlow 2, supporting multiple languages.

4.0K
Archived
Python
AI Voice & Speech
TensorFlow
#speech-synthesis#text-to-speech#tts

baidu/lac

A Chinese NLP library for tokenization, part-of-speech tagging, named entity recognition, and lexical analysis.

4.0K
Archived
C++
NLP Frameworks
API Frameworks
Java
#chinese-nlp#tokenization#part-of-speech-tagging

modelscope/ClearerVoice-Studio

An open-source toolkit for speech processing, supporting enhancement, separation, and target speaker extraction.

4.0K
Stable
Python
AI Voice & Speech
PyTorch
#speech-enhancement#speech-separation#speaker-extraction

google/lyra

A very low-bitrate speech codec for efficient audio compression, useful for various applications.

3.9K
Archived
C++
Audio & Speech
API Frameworks
C++
#audio-compression#speech-codec#low-bitrate

QwenLM/Qwen2.5-Omni

An end-to-end multimodal AI model that can understand and generate text, audio, vision, and video in real-time.

3.9K
Experimental
Jupyter Notebook
LLM Frameworks
AI Voice & Speech
Jupyter Notebook
#multimodal#text-to-speech#speech-recognition

Rikorose/DeepFilterNet

A deep learning-based noise suppression library for audio and speech enhancement applications.

3.9K
Archived
Python
Speech & Audio
API Frameworks
PyTorch
#noise-suppression#speech-enhancement#audio-processing

collabora/WhisperLive

A nearly-live implementation of OpenAI's Whisper, a powerful speech recognition and translation tool.

3.9K
Active
Python
LLM Frameworks
AI Voice & Speech
Python
#whisper#speech-recognition#translation

KoljaB/RealtimeTTS

A Python library that provides real-time text-to-speech conversion capabilities for developers.

3.8K
Active
Python
AI Voice & Speech
#realtime#speech-synthesis#text-to-speech

stakira/OpenUtau

An open-source successor to UTAU, a platform for singing voice synthesis and audio production.

3.6K
Active
C#
AI Voice & Speech
Audio Production
C#
#singing-synthesis#vocal-synthesis#speech-synthesis

avinashkranjan/Amazing-Python-Scripts

A curated collection of Python scripts from basics to advanced, including automation tasks.

3.5K
Active
Jupyter Notebook
AI Voice & Speech
Backend Frameworks
Python
#python#scripts#automation
1...57...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.