Showing 721-740 of 848 projects
An all-in-one data labeling and annotation platform for multimodal data training, supporting 3D LiDAR, images, and language models.
Kimi-VL is a multimodal AI model for advanced vision-language understanding and reasoning.
A PyTorch-based library for rendering 3D neural meshes, useful for computer vision and graphics applications.
A darknet-based text detection and OCR library for developers working with computer vision AI.
A Python library for connecting Gaussian splatting and depth in computer vision tasks like monocular depth estimation and view synthesis.
A voice-powered AI assistant that can answer questions about any application, in context and in audio.
Automatically find issues in image datasets and practice data-centric computer vision.
A comprehensive survey of deep learning-based image fusion techniques for computer vision applications.
This Jupyter Notebook series covers the fundamentals of NLP and Computer Vision, leading to cutting-edge Vision-Language Models.
A tutorial showing how to set up TensorFlow's Object Detection API on the Raspberry Pi for computer vision projects.
An open-source implementation of a state-of-the-art unsupervised image denoising algorithm for developers working with computer vision.
A curated collection of recent advances in vision-language pretrained models (VL-PTMs) for AI and multimodal applications.
Minimal solvers for calibrated camera pose estimation, useful for computer vision applications.
A C# framework for computer vision, artificial intelligence, and other research-oriented tasks.
A Python library that combines the latest version of YOLOv5 and DeepSort for object detection and tracking.
A vehicle detection project using machine learning and computer vision for self-driving cars.
Demo programs for the Talking Head Anime from a Single Image 2: More Expressive project, focused on AI and computer vision.
A Keras implementation of image outpainting, allowing developers to extend images beyond their borders.
A library for universal monocular metric depth estimation using computer vision techniques.
A collection of cool computer vision, learning, and graphics papers focused on cats.
Get weekly updates on trending AI coding tools and projects.