Explore Projects

Discover 622 open source projects

Active filters (1):

Search: audio×

Showing 61-80 of 622 projects

pyannote/pyannote-audio

A neural network library for speaker diarization, including speech activity detection, speaker change detection, and speaker embedding.

9.3K

Active

Jupyter Notebook

Speech Processing

API Frameworks

PyTorch

#speech-recognition#speaker-diarization#audio-processing

ggerganov/kbd-audio

A C++ library for acoustic keyboard eavesdropping using microphone audio capture.

9.0K

Archived

C++

API Frameworks

CLI Tools

#acoustic#eavesdrop#microphone-audio-capture

Uberi/speech_recognition

Python speech recognition library supporting multiple engines and APIs, both online and offline.

9.0K

Active

Python

AI Voice & Speech

Python

#speech-recognition#speech-to-text#audio

wwmm/easyeffects

A collection of various audio effects plugins for PipeWire, a sound server for Linux.

8.9K

Active

HTML

Audio Effects

CLI Tools

#audio-effects#pipewire#pulseaudio

jianchang512/clone-voice

A sound cloning tool that lets you use your voice or any sound to record audio, with a web interface.

8.9K

Stable

Python

AI Voice & Speech

Frontend Frameworks

React

#clonevoice#speech-analysis#tts

fudan-generative-vision/hallo

Generates hierarchical audio-driven visual synthesis for portrait image animation

8.6K

Archived

Python

React

#animation#face-animation#image-animation

aandrew-me/ytDownloader

Desktop app for downloading videos and audio from hundreds of sites with support for various platforms.

8.3K

Stable

JavaScript

General Utilities

Backend Frameworks

Electron

#downloader#video-downloader#audio-downloader

mediaelement/mediaelement

An HTML5 media player library with support for various video and audio formats, as well as streaming protocols.

8.3K

Stable

JavaScript

Component Libraries (React)

Frontend Frameworks

React

#html5#video#audio

fluent-ffmpeg/node-fluent-ffmpeg

A fluent API for the FFMPEG media processing library, enabling developers to work with video and audio files.

8.3K

Experimental

JavaScript

API Frameworks

Node.js

#ffmpeg#video-processing#audio-processing

librosa/librosa

A Python library for audio and music analysis, useful for developers working with audio-related applications.

8.2K

Active

Python

Backend Frameworks

Caching

Python

#audio#dsp#music

juce-framework/JUCE

An open-source C++ framework for building desktop and mobile applications, including audio plug-ins.

8.1K

Active

C++

React

#audio#c-plus-plus#plugin

openai/jukebox

A generative model for creating music, implemented in Python with PyTorch.

8.0K

Archived

Python

LLM Frameworks

Audio

PyTorch

#generative-model#music-generation#audio-synthesis

boson-ai/higgs-audio

Text-audio foundation model from Boson AI for vibe coders building AI-powered applications.

8.0K

Active

Python

LLM Frameworks

AI SDKs & Wrappers

Python

#text-to-speech#audio-generation#foundation-model

deniscerri/ytdlnis

A full-featured audio/video downloader for Android using the yt-dlp library.

7.9K

Active

Kotlin

Android

Audio

Kotlin

#android#audio#downloader

mumble-voip/mumble

Mumble is an open-source, low-latency, high-quality voice chat software for gaming and communication.

7.8K

Active

C++

Realtime

Full-Stack Frameworks

CMake

#voip#voice-chat#open-source

FunAudioLLM/SenseVoice

A multilingual voice understanding model for AI-powered audio analysis and transcription.

7.6K

Stable

Python

AI Voice & Speech

API Frameworks

PyTorch

#speech-recognition#speech-emotion-recognition#audio-event-classification

HumanAIGC/EMO

This repository contains a diffusion model for generating expressive portrait videos from audio.

7.6K

Archived

Computer Vision

AI Image & Video

#computer-vision#video-generation#audio-to-video

smacke/ffsubsync

Automagically synchronize subtitles with video using audio alignment and speech detection.

7.6K

Stable

Python

AI Audio & Speech

API Frameworks

#audio-alignment#speech-detection#subtitle-synchronization

snapcast/snapcast

Synchronous multiroom audio player for building audio streaming applications

7.5K

Stable

C++

Real-time

Audio Streaming

#audio#audio-streaming#multiroom-audio

clappr/clappr

An extensible, plugin-oriented, HTML5-first media player for the web

7.4K

Active

JavaScript

Component Libraries (React)

Frontend Frameworks

React

#html5-video#streaming#video-player

1...35...32

Stay in the loop

Get weekly updates on trending AI coding tools and projects.