Explore Projects

Discover 32 open source projects

Active filters (1):
Search: vision-transformerร—
Clear all

Showing 1-20 of 32 projects

open-mmlab/mmdetection

Object detection toolbox for PyTorch with support for multiple tasks and state-of-the-art models.

32.4K
Archived
Python
Computer Vision
PyTorch
#object-detection#instance-segmentation#computer-vision

lukas-blecher/LaTeX-OCR

A deep learning model that converts images of mathematical equations into LaTeX code.

16.2K
Archived
Python
Computer Vision
PyTorch
#ocr#latex#math

jacobgil/pytorch-grad-cam

Advanced AI Explainability library for computer vision models built with PyTorch.

12.7K
Experimental
Python
Computer Vision
PyTorch
#computer-vision#explainable-ai#grad-cam

NielsRogge/Transformers-Tutorials

This repository contains demos for the Transformers library by HuggingFace, a popular NLP and computer vision library.

11.5K
Active
Jupyter Notebook
LLM Frameworks
PyTorch
#transformers#bert#gpt-2

FoundationVision/VAR

An ultra-simple, state-of-the-art codebase for autoregressive image generation using advanced AI models.

8.6K
Stable
Jupyter Notebook
LLM Frameworks
Computer Vision
Jupyter Notebook
#autoregressive-models#diffusion-models#image-generation

adithya-s-k/omniparse

A Python library for ingesting, parsing, and optimizing any data format for enhanced compatibility with GenAI frameworks.

6.8K
Stable
Python
LLM Frameworks
File Storage
Python
#ingestion-api#ocr#parser-library

JingyunLiang/SwinIR

An open-source project that provides a state-of-the-art image restoration model using the Swin Transformer architecture.

5.4K
Archived
Python
Computer Vision
Backend Frameworks
Python
#image-restoration#super-resolution#computer-vision

huawei-noah/Efficient-AI-Backbones

Efficient AI model backbones developed by Huawei's Noah's Ark Lab, including GhostNet, TNT, and MLP.

4.4K
Experimental
Python
Computer Vision
Model Compression
PyTorch
#convolutional-neural-networks#efficient-inference#ghostnet

open-mmlab/mmpretrain

A pre-training toolbox and benchmark for vision AI models, including self-supervised learning and state-of-the-art architectures.

3.8K
Archived
Python
Computer Vision
ML Ops
PyTorch
#computer-vision#self-supervised-learning#pre-training

google-research/scenic

JAX library for computer vision research with transformers, attention mechanisms, and vision models

3.8K
Active
Python
Computer Vision
LLM Frameworks
JAX
#vision-transformer#computer-vision-research#jax-library

towhee-io/towhee

A fast and simple framework for building neural data processing pipelines using Python.

3.5K
Archived
Python
LLM Frameworks
Computer Vision
Python
#machine-learning#computer-vision#embeddings

mit-han-lab/efficientvit

Efficient vision foundation models for high-resolution generation and perception.

3.3K
Stable
Python
Computer Vision
ML Ops
Python
#deep-learning#computer-vision#image-generation

InternLM/InternLM-XComposer

A comprehensive multimodal system for long-term streaming video and audio interactions using large language models.

2.9K
Experimental
Python
LLM Frameworks
Computer Vision
PyTorch
#chatgpt#gpt-4#multimodal

OpenGVLab/InternVideo

A video foundation model and dataset for multimodal understanding and video understanding tasks.

2.2K
Stable
Python
Computer Vision
Datasets
PyTorch
#video-understanding#multimodal#foundation-models

hila-chefer/Transformer-Explainability

Official PyTorch implementation for a novel method to visualize classifications by Transformer based networks.

2.0K
Archived
Jupyter Notebook
Computer Vision
Documentation
PyTorch
#attention-visualization#explainability#computer-vision

ViTAE-Transformer/ViTPose

A simple vision transformer baseline for human pose estimation, with pre-trained models and advanced capabilities.

2.0K
Stable
Python
Computer Vision
Frontend Frameworks
PyTorch
#pose-estimation#vision-transformer#deep-learning

alibaba/EasyCV

An all-in-one computer vision toolkit for developers building AI-powered applications.

1.9K
Experimental
Python
Computer Vision
CLI Tools
PyTorch
#computer-vision#object-detection#self-supervised-learning

microsoft/Cream

A collection of Microsoft's work on NAS and Vision Transformer for efficient AI models.

1.8K
Archived
Python
Computer Vision
ML Ops
Python
#automl#efficiency#knowledge-distillation

MCG-NJU/VideoMAE

A self-supervised video representation learning model for video understanding tasks.

1.7K
Archived
Python
Computer Vision
API Frameworks
PyTorch
#video-analysis#video-understanding#self-supervised-learning

emcf/thepipe

A Python library that helps developers extract structured data from tricky documents using vision-language models.

1.5K
Stable
Python
LLM Frameworks
ETL & Pipelines
Python
#document-processing#large-language-models#multimodal
2

Stay in the loop

Get weekly updates on trending AI coding tools and projects.