Showing 2081-2100 of 2,275 projects
An object detection framework that bridges the gap between anchor-based and anchor-free detection models.
A Keras port of the Single Shot MultiBox Detector (SSD) for object detection in computer vision.
This repository contains information about digital humans, likely focused on computer graphics and visualization.
VideoMamba is a state space model for efficient video understanding, focused on AI and machine learning.
A Python library for OCR text recognition using a CNN-based seq2seq model with visual attention, compatible with Google Cloud ML Engine.
A PyTorch implementation of the original Transformer model with interactive visualizations.
A book on SLAM (Simultaneous Localization and Mapping) that covers geometric methods and deep learning approaches.
A PyTorch implementation of Temporal Segment Networks (TSN) for video understanding and action recognition.
Implementation of SegNet, a deep convolutional encoder-decoder for semantic pixel-wise labeling.
A COVID-19 research project that uses molecular dynamics simulations to understand the SARS-CoV-2 virus.
Aria is an open-source multimodal AI framework for building vision and language models.
AnomalyGPT is a powerful tool for detecting industrial anomalies using large vision-language models.
HYPIR is a Python library for image restoration and super-resolution using diffusion-based priors.
TableBank is a benchmark dataset for table detection and recognition, useful for building computer vision models.
Powerful SOTA computer vision model for various image enhancement tasks like denoising, deblurring, and more.
Official implementation of an ultra-fast single-view 3D reconstruction tool built with Python.
An open-source OCR engine developed by SYSU DeepDriving Lab, focused on computer vision tasks.
A Python library that implements the YOLOv3 object detection algorithm using MobileNetV2 and ASFF.
An AI-powered anime super-resolution tool built for developers who work with computer vision models.
A Python library that generates 3D textured meshes from text prompts using 2D text-to-image models.
Get weekly updates on trending AI coding tools and projects.