Showing 201-220 of 2,275 projects
A robust and highly performant video matting library for PyTorch, TensorFlow, TensorFlow.js, ONNX, and CoreML.
A collection of tutorials and notebooks on state-of-the-art computer vision models and techniques for developers.
A comprehensive collection of resources for anomaly detection, including books, papers, videos, and toolboxes.
A PyTorch library with pre-trained convolutional networks for various computer vision tasks.
ImageBind is a multimodal learning framework that learns a single embedding space to represent diverse modalities like images, text, and more.
An interactive visualization tool for learning Convolutional Neural Networks (CNNs) and deep learning.
A Python library that empowers developers to build applications and systems with self-contained computer vision capabilities.
Dolphin is a document image parsing library that uses heterogeneous anchor prompting for OCR and layout analysis.
A JavaScript library for animating 3D poses of human bodies using AI-powered algorithms.
NSFW detection on the client-side via TensorFlow.js for content moderation and filtering.
A Python library for generating 3D models from text or images using NeRF and Stable Diffusion.
ModelScope is an open-source AI framework that brings the notion of Model-as-a-Service to life, providing a comprehensive suite of tools for building, deploying, and managing AI models.
A guide to deploying deep-learning inference networks and computer vision primitives with NVIDIA Jetson hardware and TensorRT.
A comprehensive collection of study materials for deep learning interviews, covering various topics like machine learning, computer vision, and NLP.
High-resolution photorealistic video-to-video translation powered by PyTorch.
An ultra-simple, state-of-the-art codebase for autoregressive image generation using advanced AI models.
A comprehensive computer vision and robotics library for tasks like SLAM, object detection, and more.
A comprehensive SDK for Intel's RealSense depth cameras, enabling 3D computer vision applications.
A real-time and accurate full-body multi-person pose estimation and tracking system written in Python.
Lab materials for an introductory course on deep learning from MIT, covering computer vision, music generation, and more.
Get weekly updates on trending AI coding tools and projects.