Explore Projects

Discover 848 open source projects

Active filters (1):
Search: visionร—
Clear all

Showing 441-460 of 848 projects

LMD0311/Awesome-World-Model

A collection of world models for autonomous driving and robotic papers

1.9K
Stable
Unknown
#world-models#autonomous-driving#robotics

NanoNets/docext

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit.

1.9K
Stable
Python
Computer Vision
API Frameworks
Python
#document-analysis#document-data-extraction#ocr-benchmark

A9T9/RPA

Open-source RPA software with computer vision, OCR, and integration with Anthropic's AI language model.

1.9K
Experimental
JavaScript
AI Coding Agents
MCP Frameworks
Selenium
#browser-automation#computer-vision#ocr

mpatacchiola/deepgaze

A computer vision library for human-computer interaction, including head pose and gaze estimation, skin detection, motion tracking, and more.

1.9K
Archived
Python
Computer Vision
CLI Tools
Python
#computer-vision#head-pose-estimation#gaze-detection

yassouali/awesome-semi-supervised-learning

A curated list of awesome papers, methods, and resources for semi-supervised learning, a powerful machine learning technique.

1.9K
Archived
Machine Learning
Tutorials & Courses
#machine-learning#deep-learning#semi-supervised-learning

JimmyHHua/opencv_tutorials

A comprehensive collection of OpenCV 4.0 tutorials with Python, suitable for developers working with computer vision.

1.9K
Archived
Python
Computer Vision
#opencv#computer-vision#python

probcomp/Gen.jl

A general-purpose probabilistic programming system for building AI and ML applications.

1.8K
Stable
Julia
Machine Learning
Robotics
Julia
#bayesian#computer-vision#deep-learning

QianMo/OpenCV3-Intro-Book-Src

This is the source code for a book on getting started with the OpenCV computer vision library in C++.

1.8K
Archived
C++
Books & Guides
Backend Frameworks
#opencv#computer-vision#c-plus-plus

GauravBh1010tt/DeepLearn

A comprehensive repository for implementing research papers on deep learning, NLP, and computer vision in Python.

1.8K
Archived
Python
Computer Vision
NLP
Keras
#deep-learning#computer-vision#natural-language-processing

wuxiaolang/Visual_SLAM_Related_Research

This repository contains research related to visual and semantic SLAM (Simultaneous Localization and Mapping) for developers working with computer vision and robotics.

1.8K
Archived
Computer Vision
Documentation
#mapping#semantic#slam

microsoft/Cream

A collection of Microsoft's work on NAS and Vision Transformer for efficient AI models.

1.8K
Archived
Python
Computer Vision
ML Ops
Python
#automl#efficiency#knowledge-distillation

xinshuoweng/AB3DMOT

An open-source Python implementation for 3D multi-object tracking, with KITTI benchmarking and new evaluation metrics.

1.8K
Archived
Python
Computer Vision
API Frameworks
#3d-object-tracking#computer-vision#robotics

AlibabaResearch/AdvancedLiterateMachinery

An innovative AI-powered document understanding and OCR platform from Alibaba Research.

1.8K
Experimental
C++
Computer Vision
Document Intelligence
#ocr#document-recognition#document-understanding

facebookresearch/MetaCLIP

A research project from Facebook that explores multimodal AI models for computer vision and language tasks.

1.8K
Stable
Python
LLM Frameworks
Computer Vision
PyTorch
#multimodal-ai#computer-vision#language-models

Introduction-to-Autonomous-Robots/Introduction-to-Autonomous-Robots

A book covering the fundamentals of autonomous robots, including robotics, computer vision, and control systems.

1.8K
Stable
TeX
Robotics
#robotics#autonomous-systems#computer-vision

yassouali/pytorch-segmentation

A PyTorch library for building state-of-the-art semantic segmentation models for computer vision tasks.

1.8K
Experimental
Jupyter Notebook
Computer Vision
API Frameworks
PyTorch
#computer-vision#deep-learning#semantic-segmentation

sshaoshuai/PointRCNN

A 3D object proposal generation and detection library from point cloud data, useful for computer vision.

1.8K
Archived
Python
Computer Vision
#computer-vision#3d-detection#point-cloud

Turbo1123/roubao

An Android automation tool based on vision-language models that allows developers to automate mobile app interactions.

1.8K
Active
Kotlin
Computer Vision
Android
Kotlin
#android-automation#vision-language-models#mobile-agents

microsoft/GlobalMLBuildingFootprints

A dataset of worldwide building footprints derived from satellite imagery for geospatial and computer vision applications.

1.8K
Experimental
Python
Computer Vision
#geospatial#satellite-imagery#computer-vision

google-deepmind/tapnet

A deep learning-based point tracking library for computer vision and robotics applications.

1.8K
Active
Jupyter Notebook
Computer Vision
Deep Learning
Jupyter Notebook
#computer-vision#point-tracking#deep-learning
1...2224...43

Stay in the loop

Get weekly updates on trending AI coding tools and projects.