Showing 421-440 of 622 projects
TeamSpeak 6 Server beta with low-latency audio, Docker support, and Linux deployment.
Audio Spectrogram Transformer (AST) for audio classification and representation learning tasks.
A guide to PipeWire, a multimedia server that provides a professional audio/video processing workflow on Linux.
A Python extension for the Stable Diffusion WebUI that enables high-quality lip-sync animation for talking face generation.
A collection of code examples for the Web Audio API to help developers build audio-based web apps.
High-performance cross-platform audio, networking and cryptography SDKs for Android, iOS, macOS, tvOS, Linux, Windows and modern web browsers.
A Python DSP module for audio processing, sound synthesis, and music creation.
A Python library for processing and analyzing scientific audio data, particularly for bird song detection and recognition.
An open-source server implementation of Apple's AirPlay and RAOP protocols for streaming audio.
A neural network model for detecting different emotions from audio speeches using Python and deep learning.
Midarr is a minimal, lightweight media server built with Elixir, suitable for self-hosting video and audio content.
A C# library that provides a batch converter for audio and video files using the powerful FFmpeg library.
A WebAssembly-based video player that supports a wide range of codecs and streaming protocols, including h265 and websocket.
SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.
A WebUI tool to create song covers using RVC v2 AI voices from audio files or YouTube videos.
A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.
WaveGAN is a Python library that enables developers to synthesize raw audio using generative adversarial networks.
A simple GTK+ frontend for the mpv media player with support for various audio and video formats.
A command-line tool to change the audio source on macOS from the terminal.
A Python library for music source separation, a task in audio signal processing.
Get weekly updates on trending AI coding tools and projects.