Explore Projects

Discover 54 open source projects

Active filters (1):
Search: speakerร—
Clear all

Showing 41-54 of 54 projects

Marak/say.js

A simple text-to-speech library for Node.js that allows developers to add voice output to their applications.

1.5K
Archived
JavaScript
API Frameworks
#tts#text-to-speech#audio

fossasia/eventyay-talk

A Python-based event management platform with a focus on speakers and talks.

1.5K
Active
Python
API Frameworks
Backend Frameworks
Flask
#event-management#speaker-management#talks

Enemyx-net/VibeVoice-ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

1.4K
Stable
Python
AI Voice & Speech
ComfyUI Custom Nodes
React
#text-to-speech#voice-cloning#ComfyUI

music-assistant/server

Music Assistant is an open-source media library manager that connects to streaming services and smart speakers.

1.4K
Active
Python
API Frameworks
Backend Frameworks
Python
#streaming#media-library#open-source

FireRedTeam/FireRedTTS2

FireRedTTS2 is a long-form streaming TTS system for generating multi-speaker dialogue in Python.

1.3K
Stable
Python
AI Voice & Speech
API Frameworks
Python
#tts#streaming#multi-speaker

zenorocha/voice-elements

A web component wrapper for the Web Speech API, enabling voice recognition and speech synthesis.

1.3K
Archived
HTML
AI Voice & Speech
Polymer
#voice-recognition#speech-synthesis#web-components

facebookresearch/svoice

A PyTorch implementation of a voice separation algorithm for mixed audio with multiple speakers.

1.3K
Archived
Python
AI Voice & Speech
PyTorch
#audio-processing#speech-separation#voice-separation

yeyupiaoling/VoiceprintRecognition-Pytorch

This project provides advanced voiceprint recognition models and data preprocessing methods using PyTorch.

1.2K
Stable
Python
AI Voice & Speech
API Frameworks
PyTorch
#speaker-recognition#voice-recognition#arcface

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.

1.2K
Archived
Python
Audio & Speech
Signal Processing
PyTorch
#audio-processing#speech-recognition#signal-processing

wenet-e2e/wespeaker

A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.

1.2K
Active
Python
Speech & Voice
API Frameworks
PyTorch
#speech-recognition#speaker-verification#speaker-diarization

OpenMOSS/MOSS-TTSD

An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.

1.2K
Stable
Python
AI Voice & Speech
API Frameworks
#speech-dialogue-generation#multi-speaker-voice-cloning#long-form-speech-generation

clovaai/voxceleb_trainer

A Python library for training speaker recognition models using the VoxCeleb dataset.

1.2K
Archived
Python
Computer Vision
API Frameworks
Python
#speaker-recognition#speaker-verification#metric-learning

JuergenFleiss/aTrain

A Python GUI tool for offline transcription of speech recordings, including speaker diarization, using state-of-the-art machine learning models.

1.1K
Active
Python
AI Voice & Speech
CLI Tools
#speech-recognition#transcription#diarization

Edresson/YourTTS

A zero-shot multi-speaker text-to-speech (TTS) and voice conversion library for developers.

1.1K
Archived
Jupyter Notebook
AI Voice & Speech
Backend Frameworks
Jupyter Notebook
#speech-synthesis#tts#voice-conversion

Stay in the loop

Get weekly updates on trending AI coding tools and projects.