Explore Projects

Discover 54 open source projects

Active filters (1):

Search: speaker×

Clear all

Showing 41-54 of 54 projects

Marak/say.js

A simple text-to-speech library for Node.js that allows developers to add voice output to their applications.

1.5K

Archived

JavaScript

API Frameworks

#tts#text-to-speech#audio

fossasia/eventyay-talk

A Python-based event management platform with a focus on speakers and talks.

1.5K

Active

Python

API Frameworks

Backend Frameworks

Flask

#event-management#speaker-management#talks

Enemyx-net/VibeVoice-ComfyUI

ComfyUI integration for Microsoft's VibeVoice text-to-speech model

1.4K

Stable

Python

AI Voice & Speech

ComfyUI Custom Nodes

React

#text-to-speech#voice-cloning#ComfyUI

music-assistant/server

Music Assistant is an open-source media library manager that connects to streaming services and smart speakers.

1.4K

Active

Python

API Frameworks

Backend Frameworks

Python

#streaming#media-library#open-source

FireRedTeam/FireRedTTS2

FireRedTTS2 is a long-form streaming TTS system for generating multi-speaker dialogue in Python.

1.3K

Stable

Python

AI Voice & Speech

API Frameworks

Python

#tts#streaming#multi-speaker

zenorocha/voice-elements

A web component wrapper for the Web Speech API, enabling voice recognition and speech synthesis.

1.3K

Archived

HTML

AI Voice & Speech

Polymer

#voice-recognition#speech-synthesis#web-components

facebookresearch/svoice

A PyTorch implementation of a voice separation algorithm for mixed audio with multiple speakers.

1.3K

Archived

Python

AI Voice & Speech

PyTorch

#audio-processing#speech-separation#voice-separation

yeyupiaoling/VoiceprintRecognition-Pytorch

This project provides advanced voiceprint recognition models and data preprocessing methods using PyTorch.

1.2K

Stable

Python

AI Voice & Speech

API Frameworks

PyTorch

#speaker-recognition#voice-recognition#arcface

mravanelli/SincNet

SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.

1.2K

Archived

Python

Audio & Speech

Signal Processing

PyTorch

#audio-processing#speech-recognition#signal-processing

wenet-e2e/wespeaker

A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.

1.2K

Active

Python

Speech & Voice

API Frameworks

PyTorch

#speech-recognition#speaker-verification#speaker-diarization

OpenMOSS/MOSS-TTSD

An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.

1.2K

Stable

Python

AI Voice & Speech

API Frameworks

#speech-dialogue-generation#multi-speaker-voice-cloning#long-form-speech-generation

clovaai/voxceleb_trainer

A Python library for training speaker recognition models using the VoxCeleb dataset.

1.2K

Archived

Python

Computer Vision

API Frameworks

Python

#speaker-recognition#speaker-verification#metric-learning

JuergenFleiss/aTrain

A Python GUI tool for offline transcription of speech recordings, including speaker diarization, using state-of-the-art machine learning models.

1.1K

Active

Python

AI Voice & Speech

CLI Tools

#speech-recognition#transcription#diarization

Edresson/YourTTS

A zero-shot multi-speaker text-to-speech (TTS) and voice conversion library for developers.

1.1K

Archived

Jupyter Notebook

AI Voice & Speech

Backend Frameworks

Jupyter Notebook

#speech-synthesis#tts#voice-conversion

1 2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.