Explore Projects

Discover 1 open source projects

Active filters (1):
Search: multimodal-datasetsร—
Clear all

Showing 1-1 of 1 projects

salesforce/LAVIS

LAVIS is a comprehensive library for multimodal deep learning, including image captioning, visual question answering, and more.

11.2K
Archived
Jupyter Notebook
Vision-Language Transformer
PyTorch
#deep-learning#multimodal-learning#vision-language

Stay in the loop

Get weekly updates on trending AI coding tools and projects.