Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Showing 121-140 of 368 projects

kyutai-labs/pocket-tts

A lightweight, fast, and efficient text-to-speech library for developers who need to add voice functionality to their projects.

3.5K

Active

Python

AI Voice & Speech

Python

#text-to-speech#tts#voice

Azure-Samples/cognitive-services-speech-sdk

Sample code for the Microsoft Cognitive Services Speech SDK, which allows developers to build voice-enabled applications.

3.4K

Active

AI Voice & Speech

#speech-recognition#text-to-speech#voice-enabled

xenova/whisper-web

A TypeScript-powered web app that brings ML-powered speech recognition to the browser using the Whisper AI model.

3.3K

Archived

TypeScript

AI Voice & Speech

Frontend Frameworks

React

#speech-recognition#ai-models#browser-based

hankcs/pyhanlp

An open-source Chinese NLP library providing state-of-the-art tools for word segmentation, dependency parsing, named entity recognition, and more.

3.2K

Archived

Python

NLP

Python

#chinese-nlp#word-segmentation#dependency-parsing

ahmetoner/whisper-asr-webservice

A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.

3.2K

Stable

Python

AI Voice & Speech

API Clients & Testing

Flask

#speech-recognition#automatic-speech-recognition#openai-whisper

stepfun-ai/Step-Video-T2V

A Python library for converting video files to text transcripts using AI-powered speech recognition.

3.2K

Experimental

Python

AI Video & Speech

None

#speech-recognition#text-transcription#video-to-text

Kedreamix/Linly-Talker

A digital avatar conversational system that combines large language models with visual models for novel human-AI interaction.

3.2K

Experimental

Python

AI Voice & Speech

Computer Vision

Python

#ai-system#conversational-ai#multimodal

microsoft/torchscale

Foundation Architecture for (M)LLMs, a powerful toolkit for building large language models.

3.1K

Archived

Python

LLM Frameworks

API Frameworks

Python

#large-language-models#transformer#multimodal

OHF-Voice/piper1-gpl

Fast local neural text-to-speech engine for offline voice synthesis

3.1K

Active

C++

Local Inference Engines

AI Voice & Speech

C++

#text-to-speech#tts#neural

ictnlp/LLaMA-Omni

A high-quality end-to-end speech interaction model for AI-powered voice applications.

3.1K

Experimental

Python

LLM Frameworks

AI Voice & Speech

Python

#large-language-model#speech-interaction#speech-to-speech

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.

3.1K

Archived

AI Voice & Speech

#speech-recognition#speech-synthesis#language-modeling

humanetech-community/awesome-humane-tech

Curated list of projects that promote human-centric technology and ethical digital solutions.

3.1K

Archived

Awesome Lists

Privacy Tools

#privacy#ethics#decentralization

VOICEVOX/voicevox

An open-source text-to-speech software that enables high-quality, free-to-use voice generation.

3.0K

Active

TypeScript

AI Voice & Speech

TypeScript

#text-to-speech#voice-generation#open-source

IAHispano/Applio

A high-quality voice conversion tool focused on ease of use and performance for AI-powered audio applications.

3.0K

Active

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-to-speech#text-to-speech#voice-conversion

speaches-ai/speaches

An open-source library for converting speech to text using OpenAI's Whisper AI model, with Docker support.

3.0K

Active

Python

AI Voice & Speech

API Frameworks

Docker

#speech-to-text#whisper-ai#openai-api

rsxdalv/TTS-WebUI

A versatile WebUI for various AI-powered text-to-speech engines, enabling vibe coders to explore and utilize cutting-edge audio generation tools.

3.0K

Active

TypeScript

AI Voice & Speech

Component Libraries (React)

React

#text-to-speech#audio-generation#ai-tools

enhuiz/vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.

3.0K

Archived

Python

LLM Frameworks

AI Voice & Speech

PyTorch

#text-to-speech#tts#audio-lm

keithito/tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model

3.0K

Archived

Python

Speech Synthesis

API Frameworks

TensorFlow

#speech-synthesis#tacotron#machine-learning

HeyWillow/willow

Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools

3.0K

Experimental

AI Voice & Speech

AI App Builders

ESP-IDF

#alexa#google-home#speech-recognition

KevinWang676/Bark-Voice-Cloning

An open-source project for voice cloning and speech-to-text in Chinese, built using Jupyter Notebooks.

3.0K

Stable

Jupyter Notebook

AI Voice & Speech

#speech-to-text#voice-cloning#chinese-speech

1...68...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.