Showing 1-1 of 1 projects
An implementation for detailed localized image and video captioning using large multimodal models.
Get weekly updates on trending AI coding tools and projects.