Showing 561-580 of 848 projects
A Python library for benchmarking multiple object trackers (MOT) that can be used in computer vision applications.
Real-time AI assistant for Ray-Ban smart glasses using vision, voice, and agentic actions via Gemini Live.
A PyTorch library that provides Vision Transformer (ViT) adapters for dense prediction tasks like object detection and semantic segmentation.
An open-source image annotation tool for computer vision tasks.
A powerful 3D raytracing engine for creating photorealistic images and animations.
A C++ library for pixel-perfect structure-from-motion with featuremetric refinement, awarded best student paper at ICCV 2021.
A general NeRF acceleration toolbox in PyTorch for efficient neural rendering and computer vision.
Open-source foundational library for training deep learning models with PyTorch.
A simple baseline for 3D human pose estimation using TensorFlow, presented at ICCV 2017.
ColiVara is a high-performance document retrieval system that uses vision models instead of text processing.
An open-source tool for quickly annotating and labeling images for computer vision and deep learning projects.
An implementation for detailed localized image and video captioning using large multimodal models.
Amica is an open-source interface for interactive communication with 3D characters using voice synthesis and recognition.
StableVideo is a Python library for text-driven, consistency-aware diffusion-based video editing, presented at ICCV 2023.
A PyTorch implementation of the EfficientDet object detection model for high-performance computer vision tasks.
Lazyeat is a touch-free controller for use while eating, allowing developers to pause videos/full screen/switch videos just by gesturing to the camera.
A photo-realistic image colorization library using dual decoders, powered by PyTorch.
This repository contains papers and code related to vision-based robotic grasping, a field in computer vision and robotics.
A PyTorch implementation of image classification models for popular datasets like CIFAR-10, ImageNet, and more.
A C++ implementation of Halcon's shape-based matching algorithm for computer vision applications.
Get weekly updates on trending AI coding tools and projects.