Explore Projects

Discover 228 open source projects

Active filters (1):
Search: voice×
Clear all

Showing 161-180 of 228 projects

akdeb/ElatoAI

Realtime AI voice agents with state-of-the-art multimodal AI models for AI toys, companions, and devices.

1.4K
Active
TypeScript
AI Voice & Speech
Arduino & Embedded
TypeScript
#ai#voice#realtime

hungtraan/FacebookBot

A Facebook Messenger Bot with voice recognition, NLP, and features like restaurant search and memo transcription.

1.4K
Archived
JavaScript
AI Voice & Speech
API Frameworks
Node
#voice-recognition#natural-language-processing#restaurant-search

High-Logic/Genie-TTS

A GPT-SoVITS ONNX Inference Engine & Model Converter to enable voice cloning and text-to-speech for developers.

1.4K
Active
Python
AI Voice & Speech
CLI Tools
Python
#gpt-sovits#text-to-speech#tts

MiteshPuthran/Speech-Emotion-Analyzer

A neural network model for detecting different emotions from audio speeches using Python and deep learning.

1.4K
Archived
Jupyter Notebook
Speech Recognition
Machine Learning
Jupyter Notebook
#speech-recognition#emotion-detection#neural-network

TOM88812/xiaozhi-android-client

A Flutter-based Android/iOS voice chat app built on the Xiaozhi chatbot server.

1.4K
Stable
Dart
AI Voice & Speech
Cross-Platform
Flutter
#chatbot#voice-chat#ai-assistant

SociallyIneptWeeb/AICoverGen

A WebUI tool to create song covers using RVC v2 AI voices from audio files or YouTube videos.

1.4K
Experimental
Python
AI Voice & Speech
WebUI
React
#ai-audio#song-covers#webui

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K
Archived
AI Voice & Speech
Databases
#speech-recognition#speech-synthesis#speech-processing

lenML/Speech-AI-Forge

A Python-based project that provides a TTS API server and Gradio-based web UI for speech synthesis and voice generation.

1.4K
Active
Python
AI Voice & Speech
API Frameworks
Gradio
#text-to-speech#speech-synthesis#api-server

twilio/twilio-ruby

A Ruby library for interacting with the Twilio API and generating TwiML for voice and SMS applications.

1.4K
Active
Ruby
API Clients & Testing
Authentication
#twilio#sms#voice

Enemyx-net/VibeVoice-ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

1.4K
Stable
Python
AI Voice & Speech
ComfyUI Custom Nodes
React
#text-to-speech#voice-cloning#ComfyUI

zenorocha/voice-elements

A web component wrapper for the Web Speech API, enabling voice recognition and speech synthesis.

1.3K
Archived
HTML
AI Voice & Speech
Polymer
#voice-recognition#speech-synthesis#web-components

dingdang-robot/dingdang-robot

An open-source Chinese voice assistant project that runs on Raspberry Pi

1.3K
Archived
Python
API Frameworks
Raspberry Pi
Python
#voice-assistant#raspberry-pi#open-source

linyiLYi/voice-assistant

A simple local voice assistant powered by Whisper and large language models.

1.3K
Archived
Python
LLM Frameworks
AI Voice & Speech
#voice-assistant#whisper#large-language-model

alexa-pi/AlexaPi

An open-source Alexa client for building voice-enabled applications.

1.3K
Archived
Python
Prompt Engineering
React
#authentication#voice-commands#AlexaPi

altic-dev/FluidVoice

macOS offline speech-to-text app using local ML—no cloud, fully private voice dictation

1.3K
Active
Swift
Desktop Model Runners
AI Voice & Speech
Swift
#offline-dictation#voice-to-text#local-inference

CSTR-Edinburgh/merlin

This open-source Python library is a toolkit for building speech synthesis and voice conversion systems using deep learning.

1.3K
Archived
Python
Speech Synthesis
Voice Conversion
#speech-synthesis#voice-conversion#text-to-speech

facebookresearch/svoice

A PyTorch implementation of a voice separation algorithm for mixed audio with multiple speakers.

1.3K
Archived
Python
AI Voice & Speech
PyTorch
#audio-processing#speech-separation#voice-separation

mailgun/talon

Talon is a Python library for building voice interfaces and voice-driven applications.

1.3K
Archived
Python
AI Voice & Speech
CLI Tools
Python
#voice-interface#speech-recognition#voice-driven

insoxin/API

Open-source API platform offering various services like Docker, IP, QR code, and more for developers.

1.3K
Stable
JavaScript
API Development
API Clients & Testing
Node
#api#docker#ip

Robitx/gp.nvim

A Neovim AI plugin that enables ChatGPT sessions, Instructable text/code operations, and Speech to Text functionality.

1.3K
Stable
Lua
LLM Wrappers & SDKs
AI Code Editors
Neovim
#neovim#chatgpt#speech-to-text
1...810...12

Stay in the loop

Get weekly updates on trending AI coding tools and projects.