Showing 441-460 of 848 projects
A collection of world models for autonomous driving and robotic papers
An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit.
Open-source RPA software with computer vision, OCR, and integration with Anthropic's AI language model.
A computer vision library for human-computer interaction, including head pose and gaze estimation, skin detection, motion tracking, and more.
A curated list of awesome papers, methods, and resources for semi-supervised learning, a powerful machine learning technique.
A comprehensive collection of OpenCV 4.0 tutorials with Python, suitable for developers working with computer vision.
A general-purpose probabilistic programming system for building AI and ML applications.
This is the source code for a book on getting started with the OpenCV computer vision library in C++.
A comprehensive repository for implementing research papers on deep learning, NLP, and computer vision in Python.
This repository contains research related to visual and semantic SLAM (Simultaneous Localization and Mapping) for developers working with computer vision and robotics.
A collection of Microsoft's work on NAS and Vision Transformer for efficient AI models.
An open-source Python implementation for 3D multi-object tracking, with KITTI benchmarking and new evaluation metrics.
An innovative AI-powered document understanding and OCR platform from Alibaba Research.
A research project from Facebook that explores multimodal AI models for computer vision and language tasks.
A book covering the fundamentals of autonomous robots, including robotics, computer vision, and control systems.
A PyTorch library for building state-of-the-art semantic segmentation models for computer vision tasks.
A 3D object proposal generation and detection library from point cloud data, useful for computer vision.
An Android automation tool based on vision-language models that allows developers to automate mobile app interactions.
A dataset of worldwide building footprints derived from satellite imagery for geospatial and computer vision applications.
A deep learning-based point tracking library for computer vision and robotics applications.
Get weekly updates on trending AI coding tools and projects.