Explore Projects

Discover 622 open source projects

Active filters (1):
Search: audioร—
Clear all

Showing 181-200 of 622 projects

enhuiz/vall-e

An unofficial PyTorch implementation of the audio LM VALL-E, a text-to-speech AI model.

3.0K
Archived
Python
LLM Frameworks
AI Voice & Speech
PyTorch
#text-to-speech#tts#audio-lm

grame-cncm/faust

Faust is a functional programming language for signal processing and sound synthesis.

3.0K
Active
C++
Backend Frameworks
CLI Tools
#audio#c#c-plus-plus

ybayle/awesome-deep-learning-music

A curated list of articles related to deep learning applied to music and audio processing.

3.0K
Archived
TeX
Machine Learning
Tutorials & Courses
#deep-learning#music#audio-processing

chenyme/Chenyme-AAVT

This is a fully automated (audio) video translation project using Whisper for speech recognition and AI models for subtitling.

2.9K
Experimental
Python
AI Voice & Speech
API Frameworks
Python
#speech-recognition#video-translation#whisper

DISTRHO/Cardinal

A virtual modular synthesizer plugin for Eurorack and LV2-plugin formats.

2.9K
Active
C++
React
#plugin#synthesizer#eurorack

chaosprint/glicol

A graph-oriented live coding language and audio DSP library written in Rust for music and audio applications.

2.9K
Experimental
Rust
Backend & APIs
CLI Tools
Rust
#audio#dsp#live-coding

pluja/whishper

A web-based audio transcription and translation tool powered by Whisper AI models, built with Svelte.

2.9K
Stable
Svelte
AI Voice & Speech
Frontend Frameworks
Svelte
#speech-recognition#speech-to-text#transcription

vllm-project/vllm-omni

A Python framework for efficient model inference with omni-modality AI models.

2.9K
Active
Python
Inference
Multimodal
PyTorch
#audio-generation#diffusion#image-generation

InternLM/InternLM-XComposer

A comprehensive multimodal system for long-term streaming video and audio interactions using large language models.

2.9K
Experimental
Python
LLM Frameworks
Computer Vision
PyTorch
#chatgpt#gpt-4#multimodal

analogcode/Swift-Radio-Pro

A professional radio station app for iOS built with Swift, integrating with iTunes API, LastFM, and Spotify.

2.9K
Archived
Swift
Audio Player
Backend Frameworks
Swift
#audio-player#music-player#radio-station

davabase/whisper_real_time

Real-time audio transcription using the OpenAI Whisper AI model.

2.9K
Experimental
Python
LLM Wrappers & SDKs
AI Voice & Speech
Python
#audio-transcription#openai-whisper#real-time

tidalcycles/strudel

Web-based live coding environment for music patterns

2.9K
Experimental
Live Coding Environment
Next.js
#algorave#algorithmic-patterns#javascript

elevenlabs/elevenlabs-python

Official Python SDK for ElevenLabs text-to-speech API with voice synthesis & audio generation.

2.9K
Active
Python
AI SDKs & Wrappers
AI Voice & Speech
Python
#text-to-speech#voice-synthesis#elevenlabs-api

murtaza-nasir/speakr

Speakr is a personal, self-hosted web application for transcribing audio recordings

2.9K
Active
Python
React
#transcription#audio#self-hosted

MeiGen-AI/MultiTalk

Multimodal conversational video generation powered by AI, enabling new vibe-coder collaboration experiences.

2.8K
Stable
Python
LLM Frameworks
Agents & Orchestration
Python
#ai-powered#multimodal#conversational

readbeyond/aeneas

A Python/C library and toolkit for automatically synchronizing audio and text (forced alignment).

2.8K
Archived
Python
Audio & Speech
CLI Tools
Python
#audio#text-to-speech#alignment

drewnoakes/metadata-extractor

A Java library for extracting metadata from various media file formats, including images, videos, and audio.

2.8K
Experimental
Java
Libraries & Utilities
Backend Frameworks
#metadata#exif#iptc

douban/DOUAudioStreamer

A Core Audio based streaming audio player for iOS and macOS developers.

2.8K
Archived
Objective-C
API Frameworks
iOS
#streaming#audio#core-audio

Engine-Simulator/engine-sim-community-edition

A combustion engine simulation game that generates realistic audio for developers interested in realistic physics simulations.

2.8K
Stable
API Frameworks
Computer Vision
#physics-simulation#audio-generation#realistic-modeling

AIDC-AI/Pixelle-Video

A fully automated AI-powered short video engine for generating videos from text, images, and audio.

2.8K
Active
Python
AI Image & Video
AI Code Generation
Python
#aigc#video-generation#image-generation
1...911...32

Stay in the loop

Get weekly updates on trending AI coding tools and projects.