Showing 41-60 of 95 projects
A powerful universal image segmentation model that can handle various segmentation tasks with a single transformer-based model.
A rigorous benchmark for evaluating the code quality and efficiency of large language models like GPT-4.
This is a native iOS app using the exposure notification framework from Apple, focused on COVID-19 contact tracing.
A repository for ImageReward, a learning and evaluating human preferences for text-to-image generation
A roadmap to become a Visual SLAM (Simultaneous Localization and Mapping) developer in 2023.
SmoothQuant is an efficient post-training quantization tool for large language models, enabling accurate and fast inference.
A high-performance text-to-3D generation model for building immersive 3D experiences with AI tools.
EasyVolcap is a Python library that accelerates neural volumetric video research by providing a simple and efficient workflow.
A powerful benchmark for Monte Carlo Tree Search in sequential decision-making scenarios.
Official implementation of a paper on a unified Transformer-based framework for object detection and segmentation.
This appears to be a repository for sharing information about the 2023 HVV event, not a developer discovery platform.
An open-vocabulary video segmentation model that can track any object in a video, for video editing and processing.
A PyTorch library that provides Vision Transformer (ViT) adapters for dense prediction tasks like object detection and semantic segmentation.
StableVideo is a Python library for text-driven, consistency-aware diffusion-based video editing, presented at ICCV 2023.
A photo-realistic image colorization library using dual decoders, powered by PyTorch.
One-stage Retinex-based Transformer for Low-light Image Enhancement
OpenDriveLab's Birds-eye-view Perception project provides a cookbook and research for autonomous driving, with Python implementation.
An efficient diffusion model for image super-resolution, with applications in AI-powered image processing.
A PyTorch implementation of MotionBERT, a unified approach for learning human motion representations.
Official implementation of X-Decoder, a generalized decoding model for pixel, image, and language tasks.
Get weekly updates on trending AI coding tools and projects.