Showing 1-9 of 9 projects
A scalable generative AI framework for researchers and developers
A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.
A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.
Frontier CoreML audio models for iOS and macOS apps with text-to-speech, speech-to-text, voice activity detection, and speaker diarization.
A library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, for supervised speaker diarization.
This project provides advanced voiceprint recognition models and data preprocessing methods using PyTorch.
SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.
A research and production-oriented toolkit for speaker verification, recognition, and diarization using AI and ML techniques.
A Python library for training speaker recognition models using the VoxCeleb dataset.
Get weekly updates on trending AI coding tools and projects.