Showing 241-260 of 2,275 projects
PaddleGAN is a high-performance GAN library for developers working with AI-powered applications like image editing, style transfer, and motion transfer.
A powerful multi-object tracking library with modular SOTA tracking modules for segmentation, detection, and pose estimation.
A foundation model for monocular depth estimation that leverages large-scale unlabeled data.
A Python library for upsampling images using a self-supervised generative model approach.
A multilingual document layout parsing model that can extract text, images, and structure from documents in a single vision-language model.
An open-source 3D model viewer library that enables interactive 3D visualizations on the web and in AR.
An open-source AI toolkit for medical imaging and healthcare applications built on PyTorch.
A faster PyTorch implementation of the Faster R-CNN object detection algorithm.
A Swift-based library for running Stable Diffusion AI models natively on Apple Silicon Macs.
An open-source face recognition system with a rich set of computer vision features and APIs.
A Python library for fast, high-quality monocular view synthesis, useful for computer vision and 3D applications.
Cartographer is a real-time SLAM system for 2D and 3D localization and mapping across multiple platforms and sensors.
An open-source Python tool that uses AI to remove backgrounds from images and videos with a simple command line interface.
This is an implementation of the Dreambooth technique for fine-tuning Stable Diffusion models.
A modular ZK backend accelerated by GPU for building blockchain and cryptocurrency applications.
SPADE is a Python library for semantic image synthesis, enabling high-quality generation of images from semantic segmentation maps.
TensorRT implementation of popular deep learning networks for efficient inference on GPUs
AlphaFold 3 is a Python-based inference pipeline for protein structure prediction using deep learning.
A highly capable foundation model for monocular depth estimation, a key component in computer vision.
This repository contains a diffusion model for generating expressive portrait videos from audio.
Get weekly updates on trending AI coding tools and projects.