Explore Projects

Discover 32 open source projects

Active filters (1):

Search: vision-transformer×

Clear all

Showing 21-32 of 32 projects

JingyunLiang/VRT

A video restoration transformer for deblurring, denoising, and super-resolution of videos.

1.5K

Archived

Python

Computer Vision

Backend Frameworks

PyTorch

#video-restoration#deblurring#denoising

czczup/ViT-Adapter

A PyTorch library that provides Vision Transformer (ViT) adapters for dense prediction tasks like object detection and semantic segmentation.

1.5K

Experimental

Python

Computer Vision

Backend Frameworks

PyTorch

#vision-transformer#object-detection#semantic-segmentation

lightly-ai/lightly-train

All-in-one training for vision models with pretraining, fine-tuning, and distillation capabilities.

1.3K

Active

Python

Computer Vision

Fine-tuning

PyTorch

#computer-vision#deep-learning#pretrained-models

DirtyHarryLYL/Transformer-in-Vision

A collection of recent Transformer-based computer vision and related research papers.

1.3K

Archived

Computer Vision

Vision Transformers

PyTorch

#computer-vision#deep-learning#transformer

fahadshamshad/awesome-transformers-in-medical-imaging

A curated collection of resources on applying Transformers to medical imaging tasks like segmentation, classification, and synthesis.

1.3K

Archived

Computer Vision

Tutorials & Courses

#medical-imaging#transformers#computer-vision

pprp/awesome-attention-mechanism-in-cv

A curated list of attention modules and plug-and-play modules for computer vision in Python.

1.3K

Archived

Python

Computer Vision

PyTorch

#attention-mechanisms#computer-vision#implementation

yitu-opensource/T2T-ViT

A Tokens-to-Token Vision Transformer (T2T-ViT) model for training Vision Transformers from scratch on ImageNet.

1.2K

Archived

Jupyter Notebook

Computer Vision

Frontend Frameworks

Jupyter Notebook

#t2t-transformer#vision-transformer#vit

NVlabs/VoxFormer

Official PyTorch implementation of VoxFormer, a state-of-the-art 3D computer vision model for autonomous driving and scene understanding.

1.2K

Archived

Python

Computer Vision

3D Perception

PyTorch

#3d-vision#autonomous-driving#scene-understanding

uncbiag/Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks, useful for vibe coders building AI-powered applications.

1.1K

Experimental

LLM Frameworks

Multimodal Models

#foundation-models#large-language-models#multimodal-models

jacobgil/vit-explain

A library for explaining the decisions made by Vision Transformers, a type of AI model used for computer vision tasks.

1.1K

Archived

Python

Explainable AI

ML Ops

PyTorch

#explainable-ai#vision-transformer#computer-vision

OFA-Sys/ONE-PEACE

A general representation model for cross-modal learning across vision, audio, and language.

1.1K

Archived

Python

LLM Frameworks

Representation Learning

Python

#multimodal#contrastive-learning#foundation-models

WangLibo1995/GeoSeg

A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery.

1.0K

Archived

Python

Computer Vision

Backend Frameworks

PyTorch

#computer-vision#semantic-segmentation#remote-sensing

Stay in the loop

Get weekly updates on trending AI coding tools and projects.