Showing 21-39 of 39 projects
A C# library to convert Audible audiobook files (aax) to more widely supported formats like MP3 and M4A/M4B.
A multimedia framework for building audio and video processing applications in C.
An implementation of the JamesDSP audio processing engine for non-rooted Android devices.
A guide to PipeWire, a multimedia server that provides a professional audio/video processing workflow on Linux.
SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.
A C++ framework for building digital audio workstations, audio plugins, and music production tools.
A C++ library for audio digital signal processing, including effects, pitch detection, and synthesis.
Synchronized Translation for Videos: Automatic dubbing and subtitling for video content.
A C++ guitar plugin that uses neural networks to emulate a tube amplifier for audio processing and music creation.
A fundamental toolkit for music, song, and audio generation using PyTorch.
FFME is an advanced WPF MediaElement library based on FFmpeg, providing enhanced video and audio playback capabilities.
A curated list of audio DSP and plugin development resources for developers.
An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.
SincNet is a neural architecture for efficiently processing raw audio samples for speech and audio processing tasks.
A C library for generating audio fingerprints used by the AcoustID music identification service.
A powerful Digital Audio Workstation (DAW) built with Python, VST instruments/effects, and AI/ML tools like FAUST, JAX, and JUCE.
Open-source audio annotation tool for machine learning and speech processing datasets.
A PyTorch-based audio processing library for spectrograms, CQT, and neural network-based preprocessing.
Open source audio fingerprinting library in C# for building acoustic recognition applications.
Get weekly updates on trending AI coding tools and projects.