Explore Projects

Discover 848 open source projects

Active filters (1):
Search: visionร—
Clear all

Showing 21-40 of 848 projects

bytedance/UI-TARS-desktop

Multimodal AI agent stack for GUI and browser automation

28.6K
Stable
TypeScript
MCP Servers
Agents & Orchestration
TypeScript
#agent-tars#multimodal-ai#gui-agent

d2l-ai/d2l-en

Interactive deep learning book with code and math

28.4K
Archived
Python
Computer Vision
Books & Guides
Jupyter
#deep-learning#machine-learning#computer-vision

HumanSignal/label-studio

Open-source data labeling tool for AI/ML projects

26.6K
Active
TypeScript
Computer Vision
Testing
#data-labeling#annotation-tool#computer-vision

Vision-CAIR/MiniGPT-4

MiniGPT-4 and MiniGPT-v2 for vision-language tasks

25.8K
Archived
Python
Computer Vision
LLM Wrappers & SDKs
PyTorch
#vision-language#llm-wrapper#computer-vision

aishwaryanr/awesome-generative-ai-guide

Comprehensive resource for generative AI research, interviews, and courses

25.1K
Active
HTML
LLM Wrappers & SDKs
Tutorials & Courses
#generative-ai#llms#interview-prep

junyanz/pytorch-CycleGAN-and-pix2pix

PyTorch implementation of CycleGAN and pix2pix for image-to-image translation

25.0K
Experimental
Python
Computer Vision
PyTorch
#image-generation#image-manipulation#generative-adversarial-networks

HumanSignal/labelImg

Image annotation tool for computer vision projects

24.8K
Archived
Python
Computer Vision
CLI Tools
#image-annotation#computer-vision#data-labeling

haotian-liu/LLaVA

LLaVA is a visual instruction tuning framework for large language and vision models, enabling GPT-4 level capabilities.

24.5K
Archived
Python
Computer Vision
LLM Frameworks
PyTorch
#llava#gpt-4#instruction-tuning

microsoft/OmniParser

Parses GUI screenshots into structured elements for vision-based agents

24.4K
Stable
Jupyter Notebook
Computer Vision
Agent Coordination
Jupyter Notebook
#computer-vision#gui-automation#vision-based-agents

OpenBMB/MiniCPM-o

On-device multimodal LLM for vision, speech, and live streaming on phones

24.0K
Active
Python
Inference
Local Inference Engines
llama.cpp-omni
#minicpm-o#multimodal-llm#on-device-ai

pytorch/examples

PyTorch examples in vision, text, and reinforcement learning

23.8K
Stable
Python
Computer Vision
Inference
PyTorch
#pytorch#machine-learning#computer-vision

jbhuang0604/awesome-computer-vision

Curated computer vision resources for developers

23.1K
Archived
Computer Vision
#computer-vision#resources#curated-list

spmallick/learnopencv

Learn OpenCV with C++ and Python examples for computer vision and AI

22.8K
Active
Jupyter Notebook
Computer Vision
Example Projects
#opencv#computer-vision#ai

amusi/CVPR2026-Papers-with-Code

CVPR 2025 ่ฎบๆ–‡ๅ’Œๅผ€ๆบ้กน็›ฎๅˆ้›†

22.0K
Experimental
Computer Vision
#cvpr2025#papers-with-code#computer-vision

huggingface/datasets

AI-powered dataset management and preprocessing library for ML projects

21.2K
Active
Python
ML Ops
ETL & Pipelines
HuggingFace
#datasets#ml-ops#data-preprocessing

graphdeco-inria/gaussian-splatting

3D Gaussian Splatting for real-time radiance field rendering

20.8K
Stable
Python
Computer Vision
#3d-gaussian-splatting#radiance-field#computer-graphics

Skyvern-AI/skyvern

AI-powered browser automation for workflows

20.7K
Active
Python
Agents & Orchestration
Computer Vision
Playwright
#ai-automation#browser-automation#computer-vision

MaaAssistantArknights/MaaAssistantArknights

Arknights game automation with image recognition

19.8K
Active
C++
Computer Vision
Agent Coordination
C++
#arknights#game-automation#computer-vision

ShusenTang/Dive-into-DL-PyTorch

Deep learning implementation of Dive into Deep Learning using PyTorch

19.3K
Archived
Jupyter Notebook
LLM Frameworks
Desktop Model Runners
PyTorch
#Deep Learning#PyTorch#Dive into Deep Learning

mack-a/v2ray-agent

An all-in-one script for installing and configuring various proxy servers and protocols.

19.3K
Active
Shell
API Frameworks
#proxy#server#configuration
13...43

Stay in the loop

Get weekly updates on trending AI coding tools and projects.