Showing 41-60 of 622 projects
NGINX-based Media Streaming Server for real-time video and audio streaming applications.
SadTalker is a CVPR 2023 project that enables stylized audio-driven single image talking face animation.
This is a sample audio app for Android, not focused on AI coding tools.
Open-source speech toolkit with state-of-the-art ASR, TTS, translation, and audio processing capabilities.
Self-hosted audiobook and podcast server for developers who want to manage their personal audio collections.
Sonic Pi is a live coding environment for creating music and sound using Ruby.
SFML is a simple and fast multimedia library for building cross-platform games and multimedia applications in C++.
AudioKit is an audio synthesis, processing, and analysis platform for iOS, macOS, and tvOS
A PyTorch-based toolkit for speech processing, including ASR, speaker recognition, and speech enhancement.
A Windows volume control application built with C# and WPF.
A C# library for capturing screen, audio, cursor, mouse clicks and keystrokes.
openFrameworks is a cross-platform toolkit for creative coding in C++, used by developers building multimedia and interactive applications.
AudioGPT is a powerful tool for understanding and generating speech, music, sound, and talking heads using AI.
A React component for playing a variety of media URLs, including YouTube, SoundCloud, Twitch, and more.
An audio waveform player library built with TypeScript for web applications.
An open-source library for hybrid spectrogram and waveform source separation, useful for audio processing tasks.
Moshi is an open-source speech-to-text foundation model and dialogue framework for building AI-powered voice apps.
A Python library for easy high-level audio manipulation and processing.
Amphion is a toolkit for Audio, Music, and Speech Generation to support reproducible research.
A realistic combustion engine simulator that generates authentic audio for developers.
Get weekly updates on trending AI coding tools and projects.