Showing 121-140 of 159 projects
A PyTorch implementation of a Scene Graph Generation method, with visualization and extraction capabilities.
Code for a deep learning-based instance segmentation model designed for real-time performance.
Official PyTorch implementation of VoxFormer, a state-of-the-art 3D computer vision model for autonomous driving and scene understanding.
A transformer-based model for neighborhood attention, useful for computer vision tasks.
A Python library for generalized end-to-end probabilistic perspective-n-points for monocular object pose estimation.
A neural 3D mesh renderer library for creating photorealistic 3D scenes from images.
A Python library for connecting Gaussian splatting and depth in computer vision tasks like monocular depth estimation and view synthesis.
A robust dense feature matcher for estimating pixel-dense warps and reliable certainties between image pairs.
This is a MATLAB library for detecting tiny faces in images using a scale-invariant face detector.
A PyTorch library for calculating the Frechet Inception Distance (FID) with proper image resizing and quantization steps.
A collection of research papers from the Computer Vision and Pattern Recognition (CVPR) conference in 2024.
A Jupyter Notebook project that explores techniques for reconstructing images from Stable Diffusion models.
Official PyTorch implementation of a state-of-the-art virtual try-on model for high-resolution clothing generation.
Efficient architectures for interactive conditional GANs, useful for image-to-image translation tasks.
A toolbox for spectral compressive imaging reconstruction with state-of-the-art algorithms.
SETR is a transformer-based approach for rethinking semantic segmentation from a sequence-to-sequence perspective.
A fast and robust feature matching library for computer vision tasks like SfM and SLAM.
SLAM3R provides real-time dense scene reconstruction capabilities for vibe coders working with AI tools.
A state-of-the-art 3D reconstruction tool using diffusion models for high-quality 3D reconstructions.
A C++ library for real-time, rotation-invariant face detection using progressive calibration networks.
Get weekly updates on trending AI coding tools and projects.