Explore Projects

Discover 622 open source projects

Active filters (1):
Search: audioร—
Clear all

Showing 81-100 of 622 projects

OpenTalker/video-retalking

A Python library for audio-based lip synchronization in talking head video editing.

7.2K
Archived
Python
Computer Vision
Video Editing
#lip-synchronization#siggraph-asia-2022#talking-head-videos

turanszkij/WickedEngine

A modern 3D engine with advanced graphics features for game and visualization development.

6.9K
Active
C
Backend & APIs
CLI Tools
#3D#game-engine#graphics

muaz-khan/RecordRTC

A powerful WebRTC library for audio/video and screen recording, supporting multiple platforms and browsers.

6.9K
Archived
JavaScript
Frontend Frameworks
API Frameworks
React
#webrtc#recording#media-recorder

worldveil/dejavu

An audio fingerprinting and recognition library for Python that can be used to build music discovery and identification applications.

6.7K
Archived
Python
API Frameworks
Databases
#audio#fingerprinting#recognition

tenacityteam/tenacity-legacy

An open-source audio recording and editing application that focuses on privacy and security.

6.7K
Archived
C++
API Frameworks
CLI Tools
#audio#recording#privacy-friendly

bitgapp/eqMac

A macOS system-wide audio equalizer and volume mixer that allows developers to control audio on their computers.

6.5K
Stable
Swift
Audio Applications
Component Libraries (Swift)
Swift
#audio#equalizer#volume-mixer

supercollider/supercollider

An open-source audio programming environment for sound synthesis and algorithmic composition.

6.4K
Active
C++
Music
IDE Extensions
#audio#synthesis#music

mixxxdj/mixxx

Mixxx is an open-source, free DJ software that enables live music mixing and performance on various platforms.

6.4K
Active
C++
Music
Audio & Media
#music#dj#audio

Blaizzy/mlx-audio

A high-performance text-to-speech, speech-to-text, and speech-to-speech library for Apple Silicon devices.

6.1K
Active
Python
AI Voice & Speech
CLI Tools
Apple MLX
#apple-silicon#speech-recognition#speech-synthesis

souzatharsis/podcastfy

An open-source Python tool to transform multimodal content into captivating multilingual audio podcasts powered by GenAI.

6.1K
Stable
Python
LLM Wrappers & SDKs
Audio & Speech
Python
#genai#audio-generation#podcast

multimodal-art-projection/YuE

Open-source full-song music generation foundation model for developers building AI-powered audio applications.

6.1K
Experimental
Python
LLM Frameworks
Audio Generation
PyTorch
#music-generation#audio-generation#deep-learning

OpenBMB/VoxCPM

An open-source, tokenizer-free text-to-speech (TTS) model for context-aware speech generation and voice cloning.

6.0K
Active
Python
AI Voice & Speech
API Frameworks
PyTorch
#text-to-speech#voice-cloning#speech-synthesis

spotify/pedalboard

A Python library for building and experimenting with audio processing and machine learning models.

6.0K
Active
C++
Audio
API Frameworks
Python
#audio#audio-processing#audio-production

karlstav/cava

Cross-platform audio visualizer built with C, supporting multiple audio backends and platforms.

5.9K
Active
C
Animation & Motion
CLI Tools
#audio-visualization#cross-platform#cli

santinic/audiblez

A Python library that generates audiobooks from eBooks, enabling developers to create audio content experiences.

5.8K
Experimental
Python
File Storage
API Clients & Testing
Python
#audiobooks#epub#tts

NVIDIA/DALI

A highly optimized GPU-accelerated library for accelerating deep learning training and inference applications.

5.6K
Active
C++
GPU
Data Processing
PyTorch
#gpu#data-processing#deep-learning

xiangyuecn/Recorder

HTML5 JavaScript recording library that supports multiple audio formats and provides features like ASR and DTMF.

5.6K
Experimental
JavaScript
Recording & Streaming
Speech & Voice
#audio#recording#webrtc

cgzirim/seek-tune

An open-source implementation of the Shazam audio fingerprinting algorithm for song recognition in Go.

5.6K
Stable
Go
API Frameworks
Audio Processing
#audio-fingerprinting#shazam#song-recognition

ibab/tensorflow-wavenet

A TensorFlow implementation of DeepMind's WaveNet paper for generating high-quality speech audio.

5.4K
Archived
Python
ML Ops
Computer Vision
TensorFlow
#speech-generation#audio-processing#deep-learning

omriharel/deej

A Go and Arduino project that lets you build your own hardware mixer to control app volumes with real sliders.

5.4K
Archived
Go
Arduino & Embedded
#audio#volume-control#diy
1...46...32

Stay in the loop

Get weekly updates on trending AI coding tools and projects.