Showing 281-300 of 2,275 projects
Open-source platform for building low-latency vision AI agents using any model or video provider
This repository contains an efficient implementation of a vision encoding model for vision-language models.
A Python library for audio-based lip synchronization in talking head video editing.
A TensorFlow implementation of Deep Convolutional Generative Adversarial Networks (DCGAN) for building AI-powered generative models.
A modern computer vision library written in C++ for efficient computer vision tasks.
A real-time approach for mapping 2D images to a 3D surface-based model of the human body.
Draco is a C++ library for compressing and decompressing 3D geometric meshes and point clouds.
Real-time high-resolution background matting tool for computer vision and ML applications.
A Keras-based implementation of the YOLOv3 object detection model for the TensorFlow backend.
A cross-platform image super-resolution tool built with TypeScript, Vue3, and PyTorch.
OptiScaler bridges upscaling/frame gen across GPUs, supporting DLSS2+, XeSS, FSR2+ and more.
Open-source script for improving object detection models, useful for vibe coders building AI-powered apps.
A Python library for 3D photography using context-aware layered depth inpainting.
A Python library for adapting the Segment Anything Model for zero-shot visual tracking with motion-aware memory.
DUSt3R is a Python library that makes it easy to work with geometric 3D vision tasks.
A unified framework for 3D content generation, focused on AI-powered 3D creation tools.
A Python framework for creating AI agents that can learn to play any game you own
A PyTorch library for synthesizing and manipulating high-resolution 2048x1024 images using conditional GANs.
Officially maintained deep learning models by PaddlePaddle, covering computer vision, NLP, speech, and more.
A flexible and interactive tool for video object tracking and segmentation, powered by AI models.
Get weekly updates on trending AI coding tools and projects.