Showing 741-760 of 848 projects
A collection of deep learning-based image and video colorization papers for developers interested in computer vision.
A curated list of foundation models for vision and language tasks, useful for vibe coders building AI-powered applications.
Gaussian-SLAM is a Python library for photo-realistic 3D reconstruction and SLAM using Gaussian splatting.
A powerful and easy-to-use toolkit for implementing various computer vision tasks on Android, including text recognition, barcode scanning, image labeling, face detection, and object detection.
A PyTorch library for calculating the Frechet Inception Distance (FID) with proper image resizing and quantization steps.
A library for studying the robustness of computer vision models to various corruptions and perturbations.
A Python-based framework for building large language models for computer vision tasks.
A simple Python tool for labeling object bounding boxes in images, useful for computer vision tasks.
A curated collection of awesome unified multimodal models for text-to-image generation and vision-language tasks.
A collection of research papers from the Computer Vision and Pattern Recognition (CVPR) conference in 2024.
Video classification tools using 3D ResNet for action recognition and computer vision tasks.
A curated collection of image-to-image translation papers and their corresponding code.
A universal Flutter barcode and QR code scanner using popular computer vision libraries.
A curated list of research papers on visual grounding, a key technique for multimodal AI.
A computer vision curriculum and tutorials for developers interested in machine learning and AI.
An AI-powered tool for anime image inpainting and restoration, useful for computer vision and generative art projects.
Attention-based OCR library for building vision AI apps that extract text from images.
A PyTorch toolkit for 2D Human Pose Estimation, useful for developers working on computer vision and AI-powered applications.
A TensorFlow-based rotation detection benchmark for computer vision and AI models.
A 3D reconstruction library with spatial memory for computer vision applications.
Get weekly updates on trending AI coding tools and projects.