Showing 41-55 of 55 projects
A PyTorch implementation of a BERT-style pretraining method for convolutional networks, enabling more efficient self-supervised learning.
Zero-shot image restoration using a denoising diffusion model that can remove various types of noise and artifacts.
A PyTorch implementation of the Capsule Graph Neural Network (CapsGNN) for graph classification tasks.
A state-of-the-art discrete acoustic codec model for audio language modeling with 40/75 tokens per second.
Official PyTorch implementation of BigVGAN, a neural vocoder for generating high-quality audio, music, and speech.
A Python library that speeds up inference for large language models by up to 10x with dynamic sparse attention
Open-source code for a few-shot classification paper, useful for AI/ML researchers and developers.
An open-source library for creating co-speech gesture video reenactment with AI models.
A library for studying the robustness of computer vision models to various corruptions and perturbations.
ToRA is a series of Tool-integrated Reasoning LLM Agents for solving mathematical reasoning problems.
Large-scale 3D scene reconstruction and novel view synthesis using Gaussian representations.
One-shot Realistic 3D Talking Portrait Synthesis using AI and Python
A multimodal large language model series for Chinese and English AI-powered coding and painting tools.
TimesNet is an open-source library for temporal 2D-variation modeling and general time series analysis.
A Python library for generating multi-view-consistent images from a single-view image using diffusion models.
Get weekly updates on trending AI coding tools and projects.