Showing 1-3 of 3 projects
Open source implementation of CLIP, a contrastive learning model for multi-modal tasks like zero-shot classification.
A collection of tutorials and notebooks on state-of-the-art computer vision models and techniques for developers.
A video foundation model and dataset for multimodal understanding and video understanding tasks.
Get weekly updates on trending AI coding tools and projects.