Explore Projects

Discover 368 open source projects

Active filters (1):

Search: speech×

Showing 241-260 of 368 projects

semperai/amica

Amica is an open-source interface for interactive communication with 3D characters using voice synthesis and recognition.

1.4K

Experimental

TypeScript

AI Voice & Speech

Computer Vision

TypeScript

#ai-assistant#speech-recognition#text-to-speech

voice-cloning-app/Voice-Cloning-App

A Python/Pytorch app for easily synthesising human voices

1.4K

Archived

Python

Prompt Engineering

React

#text-to-speech#tts#voice-cloning

cmusphinx/sphinx4

A pure Java speech recognition library that can be used in various applications.

1.4K

Archived

Java

AI Voice & Speech

#speech-recognition#natural-language-processing#audio-processing

alexsosn/iOS_ML

A curated list of Machine Learning, AI, and NLP solutions for iOS development.

1.4K

Archived

ML SDKs & Wrappers

iOS

Swift

#machine-learning#artificial-intelligence#computer-vision

microsoft/SpeechT5

Unified-Modal Speech-Text Pre-Training for Spoken Language Processing

1.4K

Archived

Python

LLM Frameworks

AI Voice & Speech

Python

#speech-recognition#speech-synthesis#speech-text-pretraining

Ewenwan/Ros

An open-source robotics operating system (ROS) with support for speech recognition, semantic understanding, visual control, and Gazebo simulation.

1.4K

Archived

Makefile

Computer Vision

Realtime

#robotics#ros#computer-vision

YuanGongND/ast

Audio Spectrogram Transformer (AST) for audio classification and representation learning tasks.

1.4K

Archived

Jupyter Notebook

Computer Vision

Speech & Audio

PyTorch

#audio-classification#speech-recognition#deep-learning

edwko/OuteTTS

Interface for OuteTTS models, a Python library for text-to-speech using transformer-based models.

1.4K

Experimental

Python

LLM Frameworks

AI Voice & Speech

#text-to-speech#transformers#llama

m1guelpf/yt-whisper

Automatically generate YouTube subtitles using OpenAI's Whisper speech recognition model

1.4K

Archived

Python

LLM Wrappers & SDKs

Subtitles & Transcription

Python

#openai#whisper#subtitles

akdeb/ElatoAI

Realtime AI voice agents with state-of-the-art multimodal AI models for AI toys, companions, and devices.

1.4K

Active

TypeScript

AI Voice & Speech

Arduino & Embedded

TypeScript

#ai#voice#realtime

sc0ty/subsync

A C++ library for synchronizing subtitles with audio/video content using speech recognition.

1.4K

Archived

C++

API Frameworks

AI Voice & Speech

#speech-recognition#subtitle-synchronization#subtitle-processing

mdn/web-speech-api

Provides demos and examples for the Web Speech API, a powerful tool for adding speech recognition and synthesis to web apps.

1.4K

Archived

JavaScript

Frontend Frameworks

AI Voice & Speech

JavaScript

#speech-recognition#speech-synthesis#voice-interface

High-Logic/Genie-TTS

A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.

1.4K

Active

Python

AI Voice & Speech

CLI Tools

Python

#gpt-sovits#text-to-speech#tts

kyutai-labs/hibiki

Hibiki is a Rust library for building real-time speech translation models for AI-powered applications.

1.4K

Experimental

Rust

AI Voice & Speech

API Frameworks

#streaming#real-time#translation

s-macke/SAM

A lightweight, open-source speech synthesis library written in C for embedded devices like the Commodore 64.

1.4K

Archived

AI Voice & Speech

#speech-synthesis#c64#embedded

0nutation/SpeechGPT

SpeechGPT is a Python library for building speech-based applications using large language models.

1.4K

Archived

Python

LLM Frameworks

AI Voice & Speech

Python

#speech-recognition#language-models#voice-assistants

DragonComputer/Dragonfire

An open-source virtual assistant for Ubuntu-based Linux distributions, focused on speech recognition and natural language processing.

1.4K

Archived

Python

AI Voice & Speech

API Frameworks

#speech-recognition#natural-language-processing#ubuntu

MiteshPuthran/Speech-Emotion-Analyzer

A neural network model for detecting different emotions from audio speeches using Python and deep learning.

1.4K

Archived

Jupyter Notebook

Speech Recognition

Machine Learning

Jupyter Notebook

#speech-recognition#emotion-detection#neural-network

royshil/obs-localvocal

An OBS plugin that enables real-time speech recognition and captioning using AI models like OpenAI Whisper.

1.4K

Active

C++

AI Voice & Speech

Realtime

#live-streaming#realtime-transcription#speech-recognition

innnky/emotional-vits

An emotion-controllable text-to-speech model for vibe coders, built on the VITS framework.

1.4K

Archived

Jupyter Notebook

AI Voice & Speech

Jupyter Notebook

#text-to-speech#emotion-control#ai-voice

1...1214...19

Stay in the loop

Get weekly updates on trending AI coding tools and projects.