Showing 21-40 of 848 projects
Multimodal AI agent stack for GUI and browser automation
Interactive deep learning book with code and math
Open-source data labeling tool for AI/ML projects
MiniGPT-4 and MiniGPT-v2 for vision-language tasks
Comprehensive resource for generative AI research, interviews, and courses
PyTorch implementation of CycleGAN and pix2pix for image-to-image translation
Image annotation tool for computer vision projects
LLaVA is a visual instruction tuning framework for large language and vision models, enabling GPT-4 level capabilities.
Parses GUI screenshots into structured elements for vision-based agents
On-device multimodal LLM for vision, speech, and live streaming on phones
PyTorch examples in vision, text, and reinforcement learning
Curated computer vision resources for developers
Learn OpenCV with C++ and Python examples for computer vision and AI
CVPR 2025 ่ฎบๆๅๅผๆบ้กน็ฎๅ้
AI-powered dataset management and preprocessing library for ML projects
3D Gaussian Splatting for real-time radiance field rendering
AI-powered browser automation for workflows
Arknights game automation with image recognition
Deep learning implementation of Dive into Deep Learning using PyTorch
An all-in-one script for installing and configuring various proxy servers and protocols.
Get weekly updates on trending AI coding tools and projects.