Showing 1401-1420 of 2,275 projects
Stability-AI/sd3.5 is a Python library for the Stable Diffusion 3.5 model, a powerful AI-based image generation tool.
An open-source tool for quickly annotating and labeling images for computer vision and deep learning projects.
An implementation for detailed localized image and video captioning using large multimodal models.
A deep learning toolkit for medical image analysis, supporting a variety of neural network models and medical imaging tasks.
This GitHub repository provides a curated collection of resources for developing Multiple Object Tracking (MOT) applications.
Tauri-based Windows face unlock app using Vue 3 + OpenCV for biometric authentication
A scalable and efficient object detection library implemented in Keras and TensorFlow for vibe coders.
A Python library for semantic segmentation using a novel masking-based approach.
This Python project is a technical guide for beginners to study algorithms and software architectures for autonomous vehicle control.
PyTorch implementation of a multi-label image recognition model using graph convolutional networks.
Amica is an open-source interface for interactive communication with 3D characters using voice synthesis and recognition.
StableVideo is a Python library for text-driven, consistency-aware diffusion-based video editing, presented at ICCV 2023.
An official implementation of HigherHRNet, a scale-aware human pose estimation model.
A PyTorch implementation of the EfficientDet object detection model for high-performance computer vision tasks.
A PyTorch-based YOLOv4 and YOLOv5 implementation for detecting fire and smoke in images and videos.
A temporal-consistent diffusion model for real-world video super-resolution and deflickering.
This repository contains the official implementation of the MobileCLIP and MobileCLIP2 research papers, focused on AI-powered mobile app development.
A photo-realistic image colorization library using dual decoders, powered by PyTorch.
This repository contains papers and code related to vision-based robotic grasping, a field in computer vision and robotics.
A simple GUI for ByteDance's Piano Transcription with Pedals, built using the Nix programming language.
Get weekly updates on trending AI coding tools and projects.