Showing 421-440 of 848 projects
A Java-based computer vision algorithm inference framework used in the developer's projects.
A tutorial on using large language models (LLMs) and vision language models (VLMs) for various AI-powered coding tasks.
Robust Stereo Visual Inertial Odometry for fast autonomous flight using ROS
This repository provides a collection of MATLAB and Simulink project ideas to help developers gain practical experience in AI, robotics, and more.
A CSS library that provides a text-based user interface (TUI) for building terminal-style web applications.
PyTorch implementation of the YOLOv4 object detection algorithm for developers interested in computer vision.
Code for Holistically-Nested Edge Detection, a C++ library for edge detection in computer vision.
Cross-modal lip reading using 3D convolutional neural networks for speech recognition.
Official code implementation of Vary, a method for scaling up the vision vocabulary of large vision language models.
An open-source library for efficient diffusion models in computer vision, with potential AI coding tool applications.
Recipes for shrinking, optimizing, and customizing cutting-edge computer vision models.
A powerful multimodal transformer for combining language, vision, and other modalities in AI applications.
CenterNet is a Python library for object detection using keypoint triplets, with applications in computer vision.
A high-fidelity 3D creation tool from a single image using diffusion models for vibe coders.
Official implementation of the PVT (Pyramid Vision Transformer) series, a backbone model for detection and segmentation tasks.
An open-source toolbox for action understanding based on PyTorch, focused on computer vision and video analysis.
Java and Kotlin code samples for Google Cloud Platform services like AppEngine, AutoML, and Vision API.
A comprehensive list of papers, code, and resources related to NeRF and 3D Gaussian Splatting for SLAM/Robotics applications.
A Keras port of the Single Shot MultiBox Detector, a deep learning-based object detection model.
An open-source project towards developing a GPT-4-based AI assistant with vision, speech, and duplex capabilities.
Get weekly updates on trending AI coding tools and projects.