Explore Projects

Discover 368 open source projects

Active filters (1):
Search: speechร—
Clear all

Showing 241-260 of 368 projects

semperai/amica

Amica is an open-source interface for interactive communication with 3D characters using voice synthesis and recognition.

1.4K
Experimental
TypeScript
AI Voice & Speech
Computer Vision
TypeScript
#ai-assistant#speech-recognition#text-to-speech

voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

1.4K
Archived
Python
Prompt Engineering
React
#text-to-speech#tts#voice-cloning

cmusphinx/sphinx4

A pure Java speech recognition library that can be used in various applications.

1.4K
Archived
Java
AI Voice & Speech
#speech-recognition#natural-language-processing#audio-processing

alexsosn/iOS_ML

A curated list of Machine Learning, AI, and NLP solutions for iOS development.

1.4K
Archived
ML SDKs & Wrappers
iOS
Swift
#machine-learning#artificial-intelligence#computer-vision

microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

1.4K
Archived
Python
LLM Frameworks
AI Voice & Speech
Python
#speech-recognition#speech-synthesis#speech-text-pretraining

Ewenwan/Ros

An open-source robotics operating system (ROS) with support for speech recognition, semantic understanding, visual control, and Gazebo simulation.

1.4K
Archived
Makefile
Computer Vision
Realtime
#robotics#ros#computer-vision

YuanGongND/ast

Audio Spectrogram Transformer (AST) for audio classification and representation learning tasks.

1.4K
Archived
Jupyter Notebook
Computer Vision
Speech & Audio
PyTorch
#audio-classification#speech-recognition#deep-learning

edwko/OuteTTS

Interface for OuteTTS models, a Python library for text-to-speech using transformer-based models.

1.4K
Experimental
Python
LLM Frameworks
AI Voice & Speech
#text-to-speech#transformers#llama

m1guelpf/yt-whisper

Automatically generate YouTube subtitles using OpenAI's Whisper speech recognition model

1.4K
Archived
Python
LLM Wrappers & SDKs
Subtitles & Transcription
Python
#openai#whisper#subtitles

akdeb/ElatoAI

Realtime AI voice agents with state-of-the-art multimodal AI models for AI toys, companions, and devices.

1.4K
Active
TypeScript
AI Voice & Speech
Arduino & Embedded
TypeScript
#ai#voice#realtime

sc0ty/subsync

A C++ library for synchronizing subtitles with audio/video content using speech recognition.

1.4K
Archived
C++
API Frameworks
AI Voice & Speech
#speech-recognition#subtitle-synchronization#subtitle-processing

mdn/web-speech-api

Provides demos and examples for the Web Speech API, a powerful tool for adding speech recognition and synthesis to web apps.

1.4K
Archived
JavaScript
Frontend Frameworks
AI Voice & Speech
JavaScript
#speech-recognition#speech-synthesis#voice-interface

High-Logic/Genie-TTS

A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.

1.4K
Active
Python
AI Voice & Speech
CLI Tools
Python
#gpt-sovits#text-to-speech#tts

kyutai-labs/hibiki

Hibiki is a Rust library for building real-time speech translation models for AI-powered applications.

1.4K
Experimental
Rust
AI Voice & Speech
API Frameworks
#streaming#real-time#translation

s-macke/SAM

A lightweight, open-source speech synthesis library written in C for embedded devices like the Commodore 64.

1.4K
Archived
C
AI Voice & Speech
#speech-synthesis#c64#embedded

0nutation/SpeechGPT

SpeechGPT is a Python library for building speech-based applications using large language models.

1.4K
Archived
Python
LLM Frameworks
AI Voice & Speech
Python
#speech-recognition#language-models#voice-assistants

DragonComputer/Dragonfire

An open-source virtual assistant for Ubuntu-based Linux distributions, focused on speech recognition and natural language processing.

1.4K
Archived
Python
AI Voice & Speech
API Frameworks
#speech-recognition#natural-language-processing#ubuntu

MiteshPuthran/Speech-Emotion-Analyzer

A neural network model for detecting different emotions from audio speeches using Python and deep learning.

1.4K
Archived
Jupyter Notebook
Speech Recognition
Machine Learning
Jupyter Notebook
#speech-recognition#emotion-detection#neural-network

royshil/obs-localvocal

An OBS plugin that enables real-time speech recognition and captioning using AI models like OpenAI Whisper.

1.4K
Active
C++
AI Voice & Speech
Realtime
#live-streaming#realtime-transcription#speech-recognition

innnky/emotional-vits

An emotion-controllable text-to-speech model for vibe coders, built on the VITS framework.

1.4K
Archived
Jupyter Notebook
AI Voice & Speech
Jupyter Notebook
#text-to-speech#emotion-control#ai-voice
1...1214...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.