Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 121-140 of 368 projects

kyutai-labs/pocket-tts

A lightweight, fast, and efficient text-to-speech library for developers who need to add voice functionality to their projects.

3.5K
Active
Python
AI Voice & Speech
Python
#text-to-speech#tts#voice

Azure-Samples/cognitive-services-speech-sdk

Sample code for the Microsoft Cognitive Services Speech SDK, which allows developers to build voice-enabled applications.

3.4K
Active
C#
AI Voice & Speech
#speech-recognition#text-to-speech#voice-enabled

xenova/whisper-web

A TypeScript-powered web app that brings ML-powered speech recognition to the browser using the Whisper AI model.

3.3K
Archived
TypeScript
AI Voice & Speech
Frontend Frameworks
React
#speech-recognition#ai-models#browser-based

hankcs/pyhanlp

An open-source Chinese NLP library providing state-of-the-art tools for word segmentation, dependency parsing, named entity recognition, and more.

3.2K
Archived
Python
NLP
Python
#chinese-nlp#word-segmentation#dependency-parsing

ahmetoner/whisper-asr-webservice

A Python-based webservice API that provides an easy-to-use interface for the OpenAI Whisper automatic speech recognition model.

3.2K
Stable
Python
AI Voice & Speech
API Clients & Testing
Flask
#speech-recognition#automatic-speech-recognition#openai-whisper

stepfun-ai/Step-Video-T2V

A Python library for converting video files to text transcripts using AI-powered speech recognition.

3.2K
Experimental
Python
AI Video & Speech
None
#speech-recognition#text-transcription#video-to-text

Kedreamix/Linly-Talker

A digital avatar conversational system that combines large language models with visual models for novel human-AI interaction.

3.2K
Experimental
Python
AI Voice & Speech
Computer Vision
Python
#ai-system#conversational-ai#multimodal

microsoft/torchscale

Foundation Architecture for (M)LLMs, a powerful toolkit for building large language models.

3.1K
Archived
Python
LLM Frameworks
API Frameworks
Python
#large-language-models#transformer#multimodal

OHF-Voice/piper1-gpl

Fast local neural text-to-speech engine for offline voice synthesis

3.1K
Active
C++
Local Inference Engines
AI Voice & Speech
C++
#text-to-speech#tts#neural

ictnlp/LLaMA-Omni

A high-quality end-to-end speech interaction model for AI-powered voice applications.

3.1K
Experimental
Python
LLM Frameworks
AI Voice & Speech
Python
#large-language-model#speech-interaction#speech-to-speech

zzw922cn/awesome-speech-recognition-speech-synthesis-papers

A comprehensive collection of research papers on automatic speech recognition, speech synthesis, and related topics.

3.1K
Archived
AI Voice & Speech
#speech-recognition#speech-synthesis#language-modeling

humanetech-community/awesome-humane-tech

Curated list of projects that promote human-centric technology and ethical digital solutions.

3.1K
Archived
Awesome Lists
Privacy Tools
#privacy#ethics#decentralization

VOICEVOX/voicevox

An open-source text-to-speech software that enables high-quality, free-to-use voice generation.

3.0K
Active
TypeScript
AI Voice & Speech
TypeScript
#text-to-speech#voice-generation#open-source

IAHispano/Applio

A high-quality voice conversion tool focused on ease of use and performance for AI-powered audio applications.

3.0K
Active
Python
AI Voice & Speech
API Frameworks
PyTorch
#speech-to-speech#text-to-speech#voice-conversion

speaches-ai/speaches

An open-source library for converting speech to text using OpenAI's Whisper AI model, with Docker support.

3.0K
Active
Python
AI Voice & Speech
API Frameworks
Docker
#speech-to-text#whisper-ai#openai-api

rsxdalv/TTS-WebUI

A versatile WebUI for various AI-powered text-to-speech engines, enabling vibe coders to explore and utilize cutting-edge audio generation tools.

3.0K
Active
TypeScript
AI Voice & Speech
Component Libraries (React)
React
#text-to-speech#audio-generation#ai-tools

enhuiz/vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.

3.0K
Archived
Python
LLM Frameworks
AI Voice & Speech
PyTorch
#text-to-speech#tts#audio-lm

keithito/tacotron

A TensorFlow implementation of Google's Tacotron speech synthesis with pre-trained model

3.0K
Archived
Python
Speech Synthesis
API Frameworks
TensorFlow
#speech-synthesis#tacotron#machine-learning

HeyWillow/willow

Open-source, self-hosted voice assistant alternative to Alexa/Google Home, focused on privacy and AI tools

3.0K
Experimental
C
AI Voice & Speech
AI App Builders
ESP-IDF
#alexa#google-home#speech-recognition

KevinWang676/Bark-Voice-Cloning

An open-source project for voice cloning and speech-to-text in Chinese, built using Jupyter Notebooks.

3.0K
Stable
Jupyter Notebook
AI Voice & Speech
#speech-to-text#voice-cloning#chinese-speech
1...68...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.