Showing 41-54 of 54 projects
A simple text-to-speech library for Node.js that allows developers to add voice output to their applications.
A Python-based event management platform with a focus on speakers and talks.
ComfyUI integration for Microsoft's VibeVoice text-to-speech model
Music Assistant is an open-source media library manager that connects to streaming services and smart speakers.
FireRedTTS2 is a long-form streaming TTS system for generating multi-speaker dialogue in Python.
A web component wrapper for the Web Speech API, enabling voice recognition and speech synthesis.
A PyTorch implementation of a voice separation algorithm for mixed audio with multiple speakers.
This project provides advanced voiceprint recognition models and data preprocessing methods using PyTorch.
SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.
A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.
An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.
A Python library for training speaker recognition models using the VoxCeleb dataset.
A Python GUI tool for offline transcription of speech recordings, including speaker diarization, using state-of-the-art machine learning models.
A zero-shot multi-speaker text-to-speech (TTS) and voice conversion library for developers.
Get weekly updates on trending AI coding tools and projects.