Explore Projects

Discover 622 open source projects

Active filters (1):

Search: audio×

Clear all

Showing 421-440 of 622 projects

teamspeak/teamspeak6-server

TeamSpeak 6 Server beta with low-latency audio, Docker support, and Linux deployment.

1.4K

Active

Containerization

System Utilities

Docker

#teamspeak6#low-latency-audio#docker-deployment

YuanGongND/ast

Audio Spectrogram Transformer (AST) for audio classification and representation learning tasks.

1.4K

Archived

Jupyter Notebook

Computer Vision

Speech & Audio

PyTorch

#audio-classification#speech-recognition#deep-learning

mikeroyal/PipeWire-Guide

A guide to PipeWire, a multimedia server that provides a professional audio/video processing workflow on Linux.

1.4K

Experimental

Shell

CLI Tools

Linux Distros

#audio#video#multimedia

numz/sd-wav2lip-uhq

A Python extension for the Stable Diffusion WebUI that enables high-quality lip-sync animation for talking face generation.

1.4K

Archived

Python

AI Image & Video

Animation & Motion

Stable Diffusion WebUI

#audio-driven-talking-face#deep-fake#lip-sync

mdn/webaudio-examples

A collection of code examples for the Web Audio API to help developers build audio-based web apps.

1.4K

Stable

HTML

Frontend Frameworks

API Frameworks

#audio#webaudio#examples

superpoweredSDK/Low-Latency-Android-iOS-Linux-Windows-tvOS-macOS-Interactive-Audio-Platform

High-performance cross-platform audio, networking and cryptography SDKs for Android, iOS, macOS, tvOS, Linux, Windows and modern web browsers.

1.4K

Active

C++

React

#authentication#cryptography#cross-platform

belangeo/pyo

A Python DSP module for audio processing, sound synthesis, and music creation.

1.4K

Stable

Python

API Frameworks

Signal Processing

#audio#dsp#music

birdnet-team/BirdNET-Analyzer

A Python library for processing and analyzing scientific audio data, particularly for bird song detection and recognition.

1.4K

Stable

Python

Computer Vision

Caching

#bioacoustics#bird-song#deep-learning

juhovh/shairplay

An open-source server implementation of Apple's AirPlay and RAOP protocols for streaming audio.

1.4K

Archived

API Frameworks

General Utilities

#airplay#raop#streaming

MiteshPuthran/Speech-Emotion-Analyzer

A neural network model for detecting different emotions from audio speeches using Python and deep learning.

1.4K

Archived

Jupyter Notebook

Speech Recognition

Machine Learning

Jupyter Notebook

#speech-recognition#emotion-detection#neural-network

midarrlabs/midarr-server

Midarr is a minimal, lightweight media server built with Elixir, suitable for self-hosting video and audio content.

1.4K

Stable

Elixir

API Frameworks

Backend Frameworks

#media-server#self-hosted#video-streaming

eibols/ffmpeg_batch

A C# library that provides a batch converter for audio and video files using the powerful FFmpeg library.

1.4K

Active

API Frameworks

Video Encoding

#ffmpeg#batch-processing#audio-conversion

sonysuqin/WasmVideoPlayer

A WebAssembly-based video player that supports a wide range of codecs and streaming protocols, including h265 and websocket.

1.4K

Archived

JavaScript

Animation & Motion

Frontend Frameworks

React

#video-player#webassembly#streaming

bytedance/SALMONN

SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.

1.4K

Stable

LLM Frameworks

Speech Recognition

#audio-processing#speech-recognition#video-understanding

SociallyIneptWeeb/AICoverGen

A WebUI tool to create song covers using RVC v2 AI voices from audio files or YouTube videos.

1.4K

Experimental

Python

AI Voice & Speech

WebUI

React

#ai-audio#song-covers#webui

coqui-ai/open-speech-corpora

A collection of open-source speech corpora for building speech recognition, synthesis, and other audio applications.

1.4K

Archived

AI Voice & Speech

Databases

#speech-recognition#speech-synthesis#speech-processing

chrisdonahue/wavegan

WaveGAN is a Python library that enables developers to synthesize raw audio using generative adversarial networks.

1.4K

Archived

Python

Computer Vision

Generative Adversarial Networks

Python

#audio-synthesis#generative-adversarial-networks#machine-learning

celluloid-player/celluloid

A simple GTK+ frontend for the mpv media player with support for various audio and video formats.

1.4K

Active

Component Libraries (GTK)

Video Player

#gtk#mpv#media-player

deweller/switchaudio-osx

A command-line tool to change the audio source on macOS from the terminal.

1.4K

Archived

CLI Tools

General Utilities

#audio#macOS#terminal

bytedance/music_source_separation

A Python library for music source separation, a task in audio signal processing.

1.4K

Archived

Python

Computer Vision

Caching

#audio-processing#signal-processing#music-separation

1...2123...32

Stay in the loop

Get weekly updates on trending AI coding tools and projects.