Explore Projects

Discover 5 open source projects

Active filters (1):
Search: large-vision-language-modelร—
Clear all

Showing 1-5 of 5 projects

BradyFU/Awesome-Multimodal-Large-Language-Models

A collection of multimodal large language models and their latest advances.

17.4K
Active
React
#large-language-models#multimodal-chain-of-thought#in-context-learning

PKU-YuanGroup/Video-LLaVA

A large-scale vision-language model for video understanding and generation.

3.5K
Archived
Python
LLM Frameworks
Computer Vision
Python
#large-vision-language-model#video-understanding#multi-modal

InternLM/InternLM-XComposer

A comprehensive multimodal system for long-term streaming video and audio interactions using large language models.

2.9K
Experimental
Python
LLM Frameworks
Computer Vision
PyTorch
#chatgpt#gpt-4#multimodal

zhaochen0110/Awesome_Think_With_Images

A curated list of resources for leveraging visual information in large vision-language models (LVLMs) for complex reasoning, planning, and generation.

1.3K
Stable
LLM Frameworks
Multimodal Reasoning & Visual Reasoning
#large-vision-language-models#multimodal-reasoning#visual-reasoning

ShareGPT4Omni/ShareGPT4Video

An official implementation of a system for improving video understanding and generation with better captions.

1.1K
Archived
Python
LLM Frameworks
Computer Vision
PyTorch
#chatgpt#gpt-4#computer-vision

Stay in the loop

Get weekly updates on trending AI coding tools and projects.