Showing 1881-1900 of 2,275 projects
A research project for Mesh R-CNN, a deep learning model for 3D object detection and segmentation.
A Python library for connecting Gaussian splatting and depth in computer vision tasks like monocular depth estimation and view synthesis.
A darknet-based text detection and OCR library for developers working with computer vision AI.
A Rust library for image similarity comparison, simulating human perception using multiscale SSIM.
Automatically find issues in image datasets and practice data-centric computer vision.
A PyTorch library for implementing 2D Discrete Wavelet Transform and Dual Tree Complex Wavelet Transform, useful for vibe coders working with AI and signal processing.
A comprehensive guide for best practices in single-cell RNA sequencing analysis.
A comprehensive survey of deep learning-based image fusion techniques for computer vision applications.
A Python library for deep learning-based object pose estimation, with ROS inference capabilities.
This Jupyter Notebook series covers the fundamentals of NLP and Computer Vision, leading to cutting-edge Vision-Language Models.
A tutorial showing how to set up TensorFlow's Object Detection API on the Raspberry Pi for computer vision projects.
SlimYOLOv3 is a narrower, faster and better object detection model for real-time applications on UAVs.
This is an unofficial fork of OpenVSLAM, a C++ library for Visual SLAM (Simultaneous Localization and Mapping)
An open-source implementation of a state-of-the-art unsupervised image denoising algorithm for developers working with computer vision.
A curated collection of recent advances in vision-language pretrained models (VL-PTMs) for AI and multimodal applications.
Minimal solvers for calibrated camera pose estimation, useful for computer vision applications.
A robust dense feature matcher for estimating pixel-dense warps and reliable certainties between image pairs.
Open-source implementation of the YOLOv5 object detection model in PyTorch for training custom models.
A high-quality lip sync tool using deep learning techniques like GFPGAN and Wav2Lip.
A C# framework for computer vision, artificial intelligence, and other research-oriented tasks.
Get weekly updates on trending AI coding tools and projects.