Showing 1-2 of 2 projects
An open-source project that enables developers to build chatbots with video understanding using large language models.
A curated list of research papers on visual grounding, a key technique for multimodal AI.
Get weekly updates on trending AI coding tools and projects.