Showing 1-6 of 6 projects
An official implementation of the Swin Transformer, a hierarchical vision transformer for image classification and segmentation.
An official PyTorch implementation of a deep learning model for human pose estimation
A PyTorch tutorial for building an image captioning model using the Show, Attend, and Tell technique.
A computer vision library for training and deploying deep learning models, with support for popular datasets and tasks.
Bottom-up attention model for image captioning and visual question answering, built on Faster R-CNN and Visual Genome.
A PyTorch toolkit for 2D Human Pose Estimation, useful for developers working on computer vision and AI-powered applications.
Get weekly updates on trending AI coding tools and projects.