Showing 141-160 of 188 projects
A curated collection of resources on applying Transformers to medical imaging tasks like segmentation, classification, and synthesis.
A JavaScript library for building voice-controlled web applications using speech recognition and synthesis.
A comprehensive survey on knowledge distillation techniques for large language models.
A powerful library for photographic image synthesis using cascaded refinement networks.
A high-performance image synthesis model with improved distribution matching distillation.
Official implementation of a generative motion modeling framework for 3D human motions.
A Python library for planning with diffusion models to enable flexible behavior synthesis.
The official ElevenLabs MCP server, a Python-based server for the ElevenLabs AI-powered voice synthesis platform.
An all-in-one model for offline and simultaneous speech recognition, translation, and synthesis.
Mozzi is a sound synthesis library for Arduino, allowing developers to create rich audio experiences.
Efficient Region-Aware Neural Radiance Fields for high-fidelity talking portrait synthesis
A diffusion-based video generation model for trajectory-oriented video synthesis.
Code for a paper on feed-forward synthesis of textures and stylized images using neural networks.
Teensy Audio Library is a powerful C++ library for audio processing and synthesis on Teensy boards.
libpd is an embeddable audio synthesis library that allows developers to integrate Pure Data into their applications.
Open-source CAD flow for FPGA research, enabling Verilog-to-routing design and verification
LPCNet is an efficient neural speech synthesis library for developers building voice-based applications.
An efficient 3D Gaussian splatting method for novel view synthesis from sparse multi-view images.
A powerful GAN-based text-to-image synthesis model for fast large-scale image generation.
An open-source speech dialogue generation model that enables expressive dialogue speech synthesis in Chinese and English.
Get weekly updates on trending AI coding tools and projects.