Showing 861-880 of 2,275 projects
A simple BiLSTM-CRF model for named entity recognition in Chinese text, built using TensorFlow.
A collection of scripts and utilities for the Cityscapes Dataset, a popular dataset for computer vision tasks.
A collection of 200+ flashcards covering topics in machine learning, computer vision, and computer science.
Swin-Unet is a pure transformer-based model for medical image segmentation, with potential use in AI-powered coding tools.
A strong baseline and bag of tricks for deep person re-identification in computer vision.
This repository provides a tutorial and code for implementing the YOLO v3 object detection algorithm from scratch using PyTorch.
TengineKit is a fast and easy-to-use SDK for real-time face detection, face landmarks, face attributes, hand detection, body detection, and more on mobile devices.
A dataset of annotated 3D object videos for training computer vision and augmented reality models.
A GUI tool for image upscaling using the ESRGAN deep learning model.
A collection of pre-trained, state-of-the-art AI models for the ailia SDK, supporting a wide range of computer vision and natural language processing tasks.
A Python library for dense prediction transformers, a type of neural network architecture for computer vision and language tasks.
A curated collection of ICCV conference papers, code, and interpretations for computer vision developers.
This is an open-source implementation of a multimodal instruction-based editing and generation model.
A fast and robust C++ library for 3D point cloud registration, useful for robotics and SLAM applications.
Open source drivers for the Kinect for Windows v2 device, a powerful depth sensor for computer vision and robotics.
Turn any computer or edge device into a command center for your computer vision projects.
An open-source library for deep learning on satellite and aerial imagery using Python and PyTorch.
A video foundation model and dataset for multimodal understanding and video understanding tasks.
A C++ library for semi-direct visual odometry, a technique for estimating camera motion from image data.
GLIGEN is an open-source library for grounded text-to-image generation, enabling developers to create AI-powered image generation models.
Get weekly updates on trending AI coding tools and projects.