Showing 181-200 of 622 projects
An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.
Faust is a functional programming language for signal processing and sound synthesis.
A curated list of articles related to deep learning applied to music and audio processing.
This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.
A virtual modular synthesizer plugin for Eurorack and LV2-plugin formats.
A graph-oriented live coding language and audio DSP library written in Rust for music and audio applications.
A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.
A Python framework for efficient model inference with omni-modality AI models.
A comprehensive multimodal system for long-term streaming video and audio interactions using large language models.
A professional radio station app for iOS built with Swift, integrating with iTunes API, LastFM, and Spotify.
Real-time audio transcription using the OpenAI Whisper AI model.
Web-based live coding environment for music patterns
Official Python SDK for ElevenLabs text-to-speech API with voice synthesis & audio generation.
Speakr is a personal, self-hosted web application for transcribing audio recordings
Multimodal conversational video generation powered by AI, enabling new vibe-coder collaboration experiences.
A Python/C library and toolkit for automatically synchronizing audio and text (forced alignment).
A Java library for extracting metadata from various media file formats, including images, videos, and audio.
A Core Audio based streaming audio player for iOS and macOS developers.
A combustion engine simulation game that generates realistic audio for developers interested in realistic physics simulations.
A fully automated AI-powered short video engine for generating videos from text, images, and audio.
Get weekly updates on trending AI coding tools and projects.