Explore Projects

Discover 2 open source projects

Active filters (1):
Search: language-visionร—
Clear all

Showing 1-2 of 2 projects

salesforce/LAVIS

LAVIS is a comprehensive library for multimodal deep learning, including image captioning, visual question answering, and more.

11.2K
Archived
Jupyter Notebook
Vision-Language Transformer
PyTorch
#deep-learning#multimodal-learning#vision-language

unum-cloud/UForm

Multimodal AI toolkit for fast content understanding and generation across text, images, and video

1.2K
Stable
Python
LLM Frameworks
Computer Vision
PyTorch
#multimodal-ai#cross-modal#semantic-search

Stay in the loop

Get weekly updates on trending AI coding tools and projects.