Showing 101-120 of 159 projects
A Python library for 6-DoF tracking and 3D reconstruction of unknown objects using neural networks.
This repository provides an official implementation of a CVPR 2023 paper on handwriting generation using disentangled AI models.
An academic alternative to Tesla's occupancy network for autonomous driving, built with Python.
Code for generating images from scene graphs, a useful tool for AI-assisted image creation.
Image segmentation using text and image prompts for AI developers
A novel neural operator called Involution that can be used for image classification, object detection, and other computer vision tasks.
A PyTorch library for joint discriminative and generative learning for person re-identification.
Truncated diffusion model for real-time end-to-end autonomous driving, using AI tools.
A 3D-informed video generation model with precise camera control for high-quality, consistent video content.
Universal instance perception model for object detection, segmentation, and tracking in videos.
Official code for DragDiffusion, a CVPR 2024 highlight for AI-powered image editing.
A CVPR 2025 video diffusion model that enables fast autoregressive video generation from slow bidirectional models.
This repository contains a PyTorch implementation of GIRAFFE, a generative model for representing 3D scenes as compositional neural feature fields.
A PyTorch library for implementing the SRGAN super-resolution model, useful for vibe coders working on AI-driven image processing.
A diffusion-based video generation model for trajectory-oriented video synthesis.
Open-source 3D reconstruction library for scalable and generalizable 3D reconstruction from image pairs.
Official implementation of a CVPR 2024 paper on deformable 3D Gaussians for monocular dynamic scene reconstruction.
A 3D reconstruction library for creating realistic virtual humans with AI-powered normal integration.
Unsupervised learning of symmetric 3D objects from images, for computer vision and 3D reconstruction.
Scaffold-GS is a powerful 3D rendering library that uses structured Gaussian splatting for view-adaptive reconstruction.
Get weekly updates on trending AI coding tools and projects.