Showing 1-3 of 3 projects
An open-source, large language model-based multimodal dialogue system that achieves near-GPT-4o performance.
OpenMMLab's toolbox and benchmark for advanced video understanding and action recognition.
Video classification tools using 3D ResNet for action recognition and computer vision tasks.
Get weekly updates on trending AI coding tools and projects.