Explore Projects

Discover 5 open source projects

Active filters (1):

Search: large-vision-language-model×

Clear all

Showing 1-5 of 5 projects

BradyFU/Awesome-Multimodal-Large-Language-Models

A collection of multimodal large language models and their latest advances.

17.4K

Active

React

#large-language-models#multimodal-chain-of-thought#in-context-learning

PKU-YuanGroup/Video-LLaVA

A large-scale vision-language model for video understanding and generation.

3.5K

Archived

Python

LLM Frameworks

Computer Vision

Python

#large-vision-language-model#video-understanding#multi-modal

InternLM/InternLM-XComposer

A comprehensive multimodal system for long-term streaming video and audio interactions using large language models.

2.9K

Experimental

Python

LLM Frameworks

Computer Vision

PyTorch

#chatgpt#gpt-4#multimodal

zhaochen0110/Awesome_Think_With_Images

A curated list of resources for leveraging visual information in large vision-language models (LVLMs) for complex reasoning, planning, and generation.

1.3K

Stable

LLM Frameworks

Multimodal Reasoning & Visual Reasoning

#large-vision-language-models#multimodal-reasoning#visual-reasoning

ShareGPT4Omni/ShareGPT4Video

An official implementation of a system for improving video understanding and generation with better captions.

1.1K

Archived

Python

LLM Frameworks

Computer Vision

PyTorch

#chatgpt#gpt-4#computer-vision

Stay in the loop

Get weekly updates on trending AI coding tools and projects.