Showing 41-60 of 159 projects
A PyTorch-based computer vision foundation model with deformable convolutions for object detection and segmentation.
This repository contains the CVPR 2021 conference papers, which may be of interest to computer vision and AI developers.
FoundationStereo is a CVPR 2025 Best Paper Nomination project for zero-shot stereo matching using AI.
Highly efficient transformer-based model for high-resolution image restoration tasks like deblurring, deraining, and denoising.
A PyTorch implementation of a deep learning-based video inpainting algorithm.
A prompt learning framework for vision-language models.
Open-source Python library for generating synthetic text images, useful for computer vision tasks.
A CVPR'24 highlighted Python library for building Gaussian Splatting SLAM systems for robotics.
Official PyTorch implementation for a novel method to visualize classifications by Transformer based networks.
Custom Diffusion: A Python implementation of text-to-image diffusion models for computer vision applications.
A Python-based library for 3D full-head synthesis in 360-degree images, focusing on computer vision and AI.
A Python library for working with large multi-modal models, focusing on image resolution and text labeling.
Magma is a foundation model for building multimodal AI agents, enabling next-gen AI applications.
Collection of CVPR and NeurIPS poster examples and templates for AI/ML researchers and developers.
A research project from Facebook that explores multimodal AI models for computer vision and language tasks.
A 3D object proposal generation and detection library from point cloud data, useful for computer vision.
This repository provides a real-time 4D view synthesis system that can generate 4K resolution outputs.
A Vue-based chat application with real-time functionality
A PyTorch-based library for consistent depth estimation in super-long videos using transformers.
Open-source end-to-end vision-language-action model for GUI agents and computer usage analysis.
Get weekly updates on trending AI coding tools and projects.