Explore Projects

Discover 622 open source projects

Active filters (1):
Search: audioร—
Clear all

Showing 421-440 of 622 projects

teamspeak/teamspeak6-server

TeamSpeak 6 Server beta with low-latency audio, Docker support, and Linux deployment.

1.4K
Active
Containerization
System Utilities
Docker
#teamspeak6#low-latency-audio#docker-deployment

YuanGongND/ast

Audio Spectrogram Transformer (AST) for audio classification and representation learning tasks.

1.4K
Archived
Jupyter Notebook
Computer Vision
Speech & Audio
PyTorch
#audio-classification#speech-recognition#deep-learning

mikeroyal/PipeWire-Guide

A guide to PipeWire, a multimedia server that provides a professional audio/video processing workflow on Linux.

1.4K
Experimental
Shell
CLI Tools
Linux Distros
#audio#video#multimedia

numz/sd-wav2lip-uhq

A Python extension for the Stable Diffusion WebUI that enables high-quality lip-sync animation for talking face generation.

1.4K
Archived
Python
AI Image & Video
Animation & Motion
Stable Diffusion WebUI
#audio-driven-talking-face#deep-fake#lip-sync

mdn/webaudio-examples

A collection of code examples for the Web Audio API to help developers build audio-based web apps.

1.4K
Stable
HTML
Frontend Frameworks
API Frameworks
#audio#webaudio#examples

superpoweredSDK/Low-Latency-Android-iOS-Linux-Windows-tvOS-macOS-Interactive-Audio-Platform

High-performance cross-platform audio, networking and cryptography SDKs for Android, iOS, macOS, tvOS, Linux, Windows and modern web browsers.

1.4K
Active
C++
React
#authentication#cryptography#cross-platform

belangeo/pyo

A Python DSP module for audio processing, sound synthesis, and music creation.

1.4K
Stable
Python
API Frameworks
Signal Processing
#audio#dsp#music

birdnet-team/BirdNET-Analyzer

A Python library for processing and analyzing scientific audio data, particularly for bird song detection and recognition.

1.4K
Stable
Python
Computer Vision
Caching
#bioacoustics#bird-song#deep-learning

juhovh/shairplay

An open-source server implementation of Apple's AirPlay and RAOP protocols for streaming audio.

1.4K
Archived
C
API Frameworks
General Utilities
#airplay#raop#streaming

MiteshPuthran/Speech-Emotion-Analyzer

A neural network model for detecting different emotions from audio speeches using Python and deep learning.

1.4K
Archived
Jupyter Notebook
Speech Recognition
Machine Learning
Jupyter Notebook
#speech-recognition#emotion-detection#neural-network

midarrlabs/midarr-server

Midarr is a minimal, lightweight media server built with Elixir, suitable for self-hosting video and audio content.

1.4K
Stable
Elixir
API Frameworks
Backend Frameworks
#media-server#self-hosted#video-streaming

eibols/ffmpeg_batch

A C# library that provides a batch converter for audio and video files using the powerful FFmpeg library.

1.4K
Active
C#
API Frameworks
Video Encoding
#ffmpeg#batch-processing#audio-conversion

sonysuqin/WasmVideoPlayer

A WebAssembly-based video player that supports a wide range of codecs and streaming protocols, including h265 and websocket.

1.4K
Archived
JavaScript
Animation & Motion
Frontend Frameworks
React
#video-player#webassembly#streaming

bytedance/SALMONN

SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.

1.4K
Stable
LLM Frameworks
Speech Recognition
#audio-processing#speech-recognition#video-understanding

SociallyIneptWeeb/AICoverGen

A WebUI tool to create song covers using RVC v2 AI voices from audio files or YouTube videos.

1.4K
Experimental
Python
AI Voice & Speech
WebUI
React
#ai-audio#song-covers#webui

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K
Archived
AI Voice & Speech
Databases
#speech-recognition#speech-synthesis#speech-processing

chrisdonahue/wavegan

WaveGAN is a Python library that enables developers to synthesize raw audio using generative adversarial networks.

1.4K
Archived
Python
Computer Vision
Generative Adversarial Networks
Python
#audio-synthesis#generative-adversarial-networks#machine-learning

celluloid-player/celluloid

A simple GTK+ frontend for the mpv media player with support for various audio and video formats.

1.4K
Active
C
Component Libraries (GTK)
Video Player
#gtk#mpv#media-player

deweller/switchaudio-osx

A command-line tool to change the audio source on macOS from the terminal.

1.4K
Archived
C
CLI Tools
General Utilities
#audio#macOS#terminal

bytedance/music_source_separation

A Python library for music source separation, a task in audio signal processing.

1.4K
Archived
Python
Computer Vision
Caching
#audio-processing#signal-processing#music-separation
1...2123...32

Stay in the loop

Get weekly updates on trending AI coding tools and projects.