Showing 2061-2080 of 2,275 projects
Self-hosted open-source web gallery with AI-powered image discovery and tagging for photos and videos.
Real-time face tracking, expression detection, and animated emoticons for web applications.
Android document scanning library built on OpenCV, allowing cropping and perspective transformation of scanned documents.
Official PyTorch implementation of the NVAE deep hierarchical variational autoencoder for AI and ML developers.
A collection of datasets for deep learning with satellite and aerial imagery.
This project contains image processing algorithms written from scratch in Python and C++.
An unofficial Keras implementation of the Noise2Noise image denoising algorithm, useful for vibe coders working with AI-powered tools.
A simple API built with FastAPI and ddddocr for solving captchas, with Docker support.
Official implementation of SAM-Med2D, a tool for medical image segmentation using transformers.
An official implementation of a system for improving video understanding and generation with better captions.
A library for accelerating deep neural networks through channel pruning, a model compression technique.
Multi-camera live object tracking and traffic counting using YOLO v4, Deep SORT, and Flask.
A Python library for learning and using OpenCV, a popular computer vision library.
GMTalker is a 3D digital human system that integrates speech recognition, speech synthesis, natural language understanding, and mouth animation for fast deployment on Windows, Linux, and Android.
The spatial perception framework for rapidly building smart robots and spaces
A C++ library for real-time, rotation-invariant face detection using progressive calibration networks.
A powerful open-world object detection model for computer vision tasks, leveraging the DINO framework.
An open-source AI-powered face swap tool, focused on the Chinese market.
JoyCaption is an open, uncensored image captioning Visual Language Model (VLM) for training Diffusion models.
A deep reinforcement learning library for training robotic agents to plan pushing and grasping actions for manipulation tasks.
Get weekly updates on trending AI coding tools and projects.