Showing 1-3 of 3 projects
A video foundation model and dataset for multimodal understanding and video understanding tasks.
A self-supervised video representation learning model for video understanding tasks.
A PyTorch implementation of a BERT-style pretraining method for convolutional networks, enabling more efficient self-supervised learning.
Get weekly updates on trending AI coding tools and projects.