Explore Projects

Discover 32 open source projects

Active filters (1):
Search: vision-transformerร—
Clear all

Showing 21-32 of 32 projects

JingyunLiang/VRT

A video restoration transformer for deblurring, denoising, and super-resolution of videos.

1.5K
Archived
Python
Computer Vision
Backend Frameworks
PyTorch
#video-restoration#deblurring#denoising

czczup/ViT-Adapter

A PyTorch library that provides Vision Transformer (ViT) adapters for dense prediction tasks like object detection and semantic segmentation.

1.5K
Experimental
Python
Computer Vision
Backend Frameworks
PyTorch
#vision-transformer#object-detection#semantic-segmentation

lightly-ai/lightly-train

All-in-one training for vision models with pretraining, fine-tuning, and distillation capabilities.

1.3K
Active
Python
Computer Vision
Fine-tuning
PyTorch
#computer-vision#deep-learning#pretrained-models

DirtyHarryLYL/Transformer-in-Vision

A collection of recent Transformer-based computer vision and related research papers.

1.3K
Archived
Computer Vision
Vision Transformers
PyTorch
#computer-vision#deep-learning#transformer

fahadshamshad/awesome-transformers-in-medical-imaging

A curated collection of resources on applying Transformers to medical imaging tasks like segmentation, classification, and synthesis.

1.3K
Archived
Computer Vision
Tutorials & Courses
#medical-imaging#transformers#computer-vision

pprp/awesome-attention-mechanism-in-cv

A curated list of attention modules and plug-and-play modules for computer vision in Python.

1.3K
Archived
Python
Computer Vision
PyTorch
#attention-mechanisms#computer-vision#implementation

yitu-opensource/T2T-ViT

A Tokens-to-Token Vision Transformer (T2T-ViT) model for training Vision Transformers from scratch on ImageNet.

1.2K
Archived
Jupyter Notebook
Computer Vision
Frontend Frameworks
Jupyter Notebook
#t2t-transformer#vision-transformer#vit

NVlabs/VoxFormer

Official PyTorch implementation of VoxFormer, a state-of-the-art 3D computer vision model for autonomous driving and scene understanding.

1.2K
Archived
Python
Computer Vision
3D Perception
PyTorch
#3d-vision#autonomous-driving#scene-understanding

uncbiag/Awesome-Foundation-Models

A curated list of foundation models for vision and language tasks, useful for vibe coders building AI-powered applications.

1.1K
Experimental
LLM Frameworks
Multimodal Models
#foundation-models#large-language-models#multimodal-models

jacobgil/vit-explain

A library for explaining the decisions made by Vision Transformers, a type of AI model used for computer vision tasks.

1.1K
Archived
Python
Explainable AI
ML Ops
PyTorch
#explainable-ai#vision-transformer#computer-vision

OFA-Sys/ONE-PEACE

A general representation model for cross-modal learning across vision, audio, and language.

1.1K
Archived
Python
LLM Frameworks
Representation Learning
Python
#multimodal#contrastive-learning#foundation-models

WangLibo1995/GeoSeg

A UNet-like transformer for efficient semantic segmentation of remote sensing urban scene imagery.

1.0K
Archived
Python
Computer Vision
Backend Frameworks
PyTorch
#computer-vision#semantic-segmentation#remote-sensing
1

Stay in the loop

Get weekly updates on trending AI coding tools and projects.