Showing 1-20 of 39 projects
This repository is an archived collection of papers and code related to computer vision and machine learning.
Official implementation of the paper 'Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection'.
Donut is an OCR-free Document Understanding Transformer and Synthetic Document Generator for computer vision and document AI tasks.
A high-performance multi-object tracking library for real-time computer vision applications.
An open-source, camera-only framework for autonomous driving perception tasks like 3D object detection and semantic map segmentation.
A Python library for controllable and consistent human image animation with 3D parametric guidance
An open-source deep learning-based tool for automatic colorization of grayscale images.
A Python library for high-quality frame interpolation of large motion videos using neural networks.
A PyTorch implementation of a fast and accurate 3D face alignment model for computer vision applications.
An open-domain image animation tool that leverages video diffusion priors for dynamic image generation.
A diverse and well-annotated dataset for license plate detection and recognition
Contrastive unpaired image-to-image translation library using PyTorch, with faster and lighter training than CycleGAN.
A high-performance YOLO-based face detection model for computer vision applications.
GANimation is a PyTorch library for generating facial animations from a single image using generative adversarial networks.
An ECCV 2022 paper on long-term video object segmentation using an Atkinson-Shiffrin memory model.
Official code implementation of Vary, a method for scaling up the vision vocabulary of large vision language models.
SOLO and SOLOv2 are instance segmentation models for computer vision tasks, built with PyTorch.
BrushNet is a plug-and-play image inpainting model with decomposed dual-branch diffusion, presented at ECCV 2024.
A PyTorch-based framework for reproducible deep learning studies with 26 knowledge distillation methods.
Code and models for Temporal Segment Networks (TSN) for action recognition in video understanding.
Get weekly updates on trending AI coding tools and projects.