Showing 381-400 of 848 projects
A dataset of annotated 3D object videos for training computer vision and augmented reality models.
A collection of pre-trained, state-of-the-art AI models for the ailia SDK, supporting a wide range of computer vision and natural language processing tasks.
A Python library for dense prediction transformers, a type of neural network architecture for computer vision and language tasks.
A curated collection of ICCV conference papers, code, and interpretations for computer vision developers.
Open source drivers for the Kinect for Windows v2 device, a powerful depth sensor for computer vision and robotics.
Turn any computer or edge device into a command center for your computer vision projects.
An open-source library for deep learning on satellite and aerial imagery using Python and PyTorch.
A video foundation model and dataset for multimodal understanding and video understanding tasks.
A Java library for Android that provides computer vision capabilities like object detection, tracking, and face recognition.
A prompt learning framework for vision-language models.
A curated list of image and video inpainting resources for developers working on computer vision and media processing applications.
A Python library that converts images of LaTeX math equations into LaTeX code using computer vision and deep learning.
SparseML provides a library for applying sparsification recipes to neural networks with a few lines of code, enabling faster and smaller models.
A handwritten text recognition system implemented using TensorFlow for developers working with computer vision and OCR.
Open-source Python library for generating synthetic text images, useful for computer vision tasks.
A GPU-accelerated MediaPipe plugin for the TouchDesigner creative coding platform, enabling advanced computer vision capabilities.
A Go-based tool that uses LLMs and LLM Vision (OCR) to digitize documents powered by AI.
An effective paradigm for building tiny-scale vision-language-action models for robotics and embodied AI.
A library for differentiable nonlinear optimization, useful for computer vision and robotics tasks.
A CVPR'24 highlighted Python library for building Gaussian Splatting SLAM systems for robotics.
Get weekly updates on trending AI coding tools and projects.