Showing 1-4 of 4 projects
Open-source, strong foundation image recognition models for developers building with AI tools.
A research project from Facebook that explores multimodal AI models for computer vision and language tasks.
Official PyTorch implementation of TokenFlow, a novel diffusion-based method for consistent video editing presented at ICLR 2024.
SALMONN is a suite of advanced multi-modal large language models (LLMs) for audio, speech, and video understanding.
Get weekly updates on trending AI coding tools and projects.