Showing 1-6 of 6 projects
A comprehensive reading list for research topics in multimodal machine learning.
A prompt learning framework for vision-language models.
A curated list of research on multimodal learning, useful for developers working on AI-powered applications.
A CVPR 2024 and TPAMI 2025 AI-powered multimodal learning architecture for vibe coders.
An official implementation of CLIP4Clip, a model for end-to-end video clip retrieval.
A comparative framework for building multimodal recommender systems using collaborative filtering and matrix factorization.
Get weekly updates on trending AI coding tools and projects.