Computer Vision

Explore 2,275 open source projects in Computer Vision

Showing 1361-1380 of 2,275 projects

tinyvision/SOLIDER

A self-supervised learning framework for learning general human representations from unlabeled images.

1.5K
Archived
Python
Computer Vision
Research Papers
PyTorch
#computer-vision#self-supervised-learning#human-representation

evo-design/evo

This is a Jupyter Notebook project focused on biological foundation modeling from molecular to genome scale.

1.5K
Stable
Jupyter Notebook
Computer Vision
Databases
#biology#modeling#jupyter-notebook

tianrun-chen/SAM-Adapter-PyTorch

Adapts Meta AI's Segment Anything model to downstream tasks using adapters and prompts.

1.5K
Stable
Python
Computer Vision
Fine-tuning
PyTorch
#2d-segmentation#image-segmentation#segment-anything

chainer/chainercv

ChainerCV is a Python library for deep learning in computer vision tasks such as object detection and segmentation.

1.5K
Archived
Python
Computer Vision
API Frameworks
Python
#computer-vision#deep-learning#neural-network

luuuyi/CBAM.PyTorch

Non-official implementation of the CBAM paper, a convolutional block attention module for neural networks.

1.5K
Archived
Python
Computer Vision
PyTorch
#computer-vision#neural-networks#attention-mechanism

speedinghzl/CCNet

CCNet is a semantic segmentation library that uses Criss-Cross Attention to improve scene parsing performance.

1.5K
Archived
Python
Computer Vision
Backend Frameworks
PyTorch
#semantic-segmentation#scene-parsing#self-attention

ruotianluo/ImageCaptioning.pytorch

This PyTorch-based repository provides tools for developing image captioning models.

1.5K
Archived
Python
Computer Vision
PyTorch
#machine-learning#computer-vision#image-captioning

cheind/py-motmetrics

A Python library for benchmarking multiple object trackers (MOT) that can be used in computer vision applications.

1.5K
Experimental
Python
Computer Vision
Testing
#benchmark#object-detection#object-tracking

sseanliu/VisionClaw

Real-time AI assistant for Ray-Ban smart glasses using vision, voice, and agentic actions via Gemini Live.

1.5K
Active
Agents & Orchestration
Computer Vision
Gemini Live
#smart glasses#vision agent#real-time AI

fastplotlib/fastplotlib

A next-gen fast plotting library for scientific visualization running on WGPU and the pygfx rendering engine.

1.5K
Active
Python
Charts & Visualization
CLI Tools
Python
#fast-visualization#gpu#interactive-visualizations

czczup/ViT-Adapter

A PyTorch library that provides Vision Transformer (ViT) adapters for dense prediction tasks like object detection and semantic segmentation.

1.5K
Experimental
Python
Computer Vision
Backend Frameworks
PyTorch
#vision-transformer#object-detection#semantic-segmentation

faustomorales/keras-ocr

A flexible Python library for optical character recognition (OCR) using the CRAFT text detector and Keras CRNN recognition model.

1.5K
Stable
Python
Computer Vision
API Frameworks
Keras
#ocr#text-detection#keras-crnn

deepwel/Chinese-Annotator

An open-source tool for annotating Chinese text corpus, useful for NLP and text analysis projects.

1.5K
Archived
JavaScript
Computer Vision
Search
React
#nlp#text-annotation#chinese

robocorp/rpaframework

Open-source RPA framework for Python and Robot Framework, focused on automation and AI-powered document processing.

1.5K
Stable
Python
Computer Vision
OCR
Robot Framework
#automation#rpa#robotframework

thu-ml/unidiffuser

An open-source library for training and running state-of-the-art diffusion models in Python.

1.5K
Archived
Python
LLM Frameworks
ML Ops
Python
#diffusion-models#computer-vision#machine-learning

gaoxiang12/faster-lio

Lightweight, tightly coupled lidar-inertial odometry using parallel sparse incremental voxels.

1.5K
Stable
C++
Computer Vision
#lidar#inertial-odometry#voxels

baaivision/Emu3.5

A Python library that provides native multimodal models for building world-learning AI systems.

1.5K
Stable
Python
LLM Frameworks
Agents & Orchestration
Python
#multimodal#world-learning#agents

peteanderson80/bottom-up-attention

Bottom-up attention model for image captioning and visual question answering, built on Faster R-CNN and Visual Genome.

1.5K
Archived
Jupyter Notebook
Computer Vision
ML Ops
Caffe
#image-captioning#visual-question-answering#faster-rcnn

sxyu/pixel-nerf

PixelNeRF is a Python library for training and using NeRF models for neural volumetric rendering.

1.5K
Archived
Python
Computer Vision
API Frameworks
Python
#neural-rendering#computer-vision#3d-reconstruction

CSAILVision/LabelMeAnnotationTool

An open-source image annotation tool for computer vision tasks.

1.5K
Archived
JavaScript
Computer Vision
Component Libraries (React)
React
#annotation#computer-vision#image-labeling
1...6870...114

Stay in the loop

Get weekly updates on trending AI coding tools and projects.