Showing 621-640 of 848 projects
Open source N-dimensional image processing library for scientific computing and computer vision.
Comprehensive overview of Japanese Large Language Models (LLMs) for developers interested in generative AI.
A curated list of resources for leveraging visual information in large vision-language models (LVLMs) for complex reasoning, planning, and generation.
Continuous 3D Perception Model with Persistent State, a Python library for 3D computer vision tasks.
Real-time facial emotion detection using deep learning and computer vision.
A Python library for learning and using OpenCV, a popular computer vision library, through a free online course.
Realtime head pose estimation using ONNX and OpenCV for computer vision and AI applications.
An open-source AI-powered computer vision model for object detection, segmentation, and understanding.
Real-time camera monitoring with VLM using TypeScript and React.
All-in-one training for vision models with pretraining, fine-tuning, and distillation capabilities.
Official codebase for OMG-LLaVA and OMG-Seg, state-of-the-art computer vision models presented at CVPR-24 and NeurIPS-24.
A collection of recent Transformer-based computer vision and related research papers.
A strong and open-source vision language assistant for mobile devices, leveraging the power of AI.
Official implementation of a paper on estimating geometry in the presence of motion, likely useful for computer vision and 3D applications.
A free and open-source tool that converts images to 3D parallax effect videos using AI depth estimation.
A deep learning-based face detection and recognition library implemented in PyTorch for developers working with computer vision.
This repository provides an official implementation of a CVPR 2023 paper on handwriting generation using disentangled AI models.
An open-source framework for object counting systems using TensorFlow and Keras.
A curated list of resources related to person re-identification, a computer vision task to identify people across different cameras or scenarios.
A fast and efficient computer vision library for edge devices, supporting face, head, pedestrian, and vehicle detection.
Get weekly updates on trending AI coding tools and projects.