Explore Projects

Discover 622 open source projects

Active filters (1):

Search: audio×

Clear all

Showing 181-200 of 622 projects

enhuiz/vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.

3.0K

Archived

Python

LLM Frameworks

AI Voice & Speech

PyTorch

#text-to-speech#tts#audio-lm

grame-cncm/faust

Faust is a functional programming language for signal processing and sound synthesis.

3.0K

Active

C++

Backend Frameworks

CLI Tools

#audio#c#c-plus-plus

ybayle/awesome-deep-learning-music

A curated list of articles related to deep learning applied to music and audio processing.

3.0K

Archived

TeX

Machine Learning

Tutorials & Courses

#deep-learning#music#audio-processing

chenyme/Chenyme-AAVT

This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.

2.9K

Experimental

Python

AI Voice & Speech

API Frameworks

Python

#speech-recognition#video-translation#whisper

DISTRHO/Cardinal

A virtual modular synthesizer plugin for Eurorack and LV2-plugin formats.

2.9K

Active

C++

React

#plugin#synthesizer#eurorack

chaosprint/glicol

A graph-oriented live coding language and audio DSP library written in Rust for music and audio applications.

2.9K

Experimental

Rust

Backend & APIs

CLI Tools

Rust

#audio#dsp#live-coding

pluja/whishper

A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.

2.9K

Stable

Svelte

AI Voice & Speech

Frontend Frameworks

Svelte

#speech-recognition#speech-to-text#transcription

vllm-project/vllm-omni

A Python framework for efficient model inference with omni-modality AI models.

2.9K

Active

Python

Inference

Multimodal

PyTorch

#audio-generation#diffusion#image-generation

InternLM/InternLM-XComposer

A comprehensive multimodal system for long-term streaming video and audio interactions using large language models.

2.9K

Experimental

Python

LLM Frameworks

Computer Vision

PyTorch

#chatgpt#gpt-4#multimodal

analogcode/Swift-Radio-Pro

A professional radio station app for iOS built with Swift, integrating with iTunes API, LastFM, and Spotify.

2.9K

Archived

Swift

Audio Player

Backend Frameworks

Swift

#audio-player#music-player#radio-station

davabase/whisper_real_time

Real-time audio transcription using the OpenAI Whisper AI model.

2.9K

Experimental

Python

LLM Wrappers & SDKs

AI Voice & Speech

Python

#audio-transcription#openai-whisper#real-time

tidalcycles/strudel

Web-based live coding environment for music patterns

2.9K

Experimental

Live Coding Environment

Next.js

#algorave#algorithmic-patterns#javascript

elevenlabs/elevenlabs-python

Official Python SDK for ElevenLabs text-to-speech API with voice synthesis & audio generation.

2.9K

Active

Python

AI SDKs & Wrappers

AI Voice & Speech

Python

#text-to-speech#voice-synthesis#elevenlabs-api

murtaza-nasir/speakr

Speakr is a personal, self-hosted web application for transcribing audio recordings

2.9K

Active

Python

React

#transcription#audio#self-hosted

MeiGen-AI/MultiTalk

Multimodal conversational video generation powered by AI, enabling new vibe-coder collaboration experiences.

2.8K

Stable

Python

LLM Frameworks

Agents & Orchestration

Python

#ai-powered#multimodal#conversational

readbeyond/aeneas

A Python/C library and toolkit for automatically synchronizing audio and text (forced alignment).

2.8K

Archived

Python

Audio & Speech

CLI Tools

Python

#audio#text-to-speech#alignment

drewnoakes/metadata-extractor

A Java library for extracting metadata from various media file formats, including images, videos, and audio.

2.8K

Experimental

Java

Libraries & Utilities

Backend Frameworks

#metadata#exif#iptc

douban/DOUAudioStreamer

A Core Audio based streaming audio player for iOS and macOS developers.

2.8K

Archived

Objective-C

API Frameworks

iOS

#streaming#audio#core-audio

Engine-Simulator/engine-sim-community-edition

A combustion engine simulation game that generates realistic audio for developers interested in realistic physics simulations.

2.8K

Stable

API Frameworks

Computer Vision

#physics-simulation#audio-generation#realistic-modeling

AIDC-AI/Pixelle-Video

A fully automated AI-powered short video engine for generating videos from text, images, and audio.

2.8K

Active

Python

AI Image & Video

AI Code Generation

Python

#aigc#video-generation#image-generation

1...911...32

Stay in the loop

Get weekly updates on trending AI coding tools and projects.