Showing 1-20 of 622 projects
Model framework for state-of-the-art ML models in text, vision, audio, and multimodal tasks.
FFmpeg processes multimedia content like audio and video.
Self-hosted, open-source AI alternative to OpenAI with local LLM inference, no GPU required
Open source HTML5 video player with support for HLS, DASH, and cross-platform compatibility
Text-to-audio model for generating realistic speech and sounds
Instant voice cloning model with tone color cloning and multi-lingual support
Voice conversion framework with web UI for training and real-time voice models
Command-line media player with support for various formats and codecs
Cross-platform ML framework for real-time media processing
State-of-the-art diffusion models for image, audio, and video generation in PyTorch.
Single-file public domain C/C++ libraries for image processing, audio decoding, and utilities
File upload widget for jQuery with drag&drop, progress bars, and cross-domain support
SRS is a high-efficiency real-time media server supporting multiple streaming protocols and codecs.
Source separation library for audio processing with pretrained models
Singing Voice Conversion framework using AI
Browser fingerprinting library for tracking users
JavaScript audio library for modern web with Web Audio API fallback to HTML5 Audio
Image annotation tool for computer vision projects
GUI for vocal removal using deep learning models
AudioCraft is a PyTorch library for audio generation with deep learning models like MusicGen and AudioGen.
Get weekly updates on trending AI coding tools and projects.