Explore Projects

Discover 13 open source projects

Active filters (1):
Search: llavaร—
Clear all

Showing 1-13 of 13 projects

ollama/ollama

Run open-source AI models locally with Ollama, supporting multiple frameworks and integrations.

164.2K
Active
Go
AI Model Serving
Go
#ollama#ai-model-serving#local-ai

haotian-liu/LLaVA

LLaVA is a visual instruction tuning framework for large language and vision models, enabling GPT-4 level capabilities.

24.5K
Archived
Python
Computer Vision
LLM Frameworks
PyTorch
#llava#gpt-4#instruction-tuning

modelscope/ms-swift

A Python library for using and fine-tuning over 900 large language models and multimodal models for various AI tasks.

12.9K
Active
Python
LLM Frameworks
Python
#llm#multimodal#fine-tuning

Fanghua-Yu/SUPIR

SUPIR is a Python library for developing practical algorithms for photo-realistic image restoration using AI.

5.5K
Experimental
Python
Computer Vision
Inference
PyTorch
#deep-learning#diffusion-models#llava

open-compass/VLMEvalKit

Open-source toolkit for evaluating large multi-modal AI models, supporting 220+ models and 80+ benchmarks.

3.9K
Active
Python
LLM Frameworks
LLM Wrappers & SDKs
PyTorch
#chatgpt#llm#multi-modal

yuanzhoulvpi2017/zero_nlp

A Chinese NLP solution with large models, data, training, and inference capabilities for developers.

3.8K
Stable
Jupyter Notebook
LLM Frameworks
API Frameworks
PyTorch
#bert#chatglm-6b#gpt

PKU-YuanGroup/Video-LLaVA

A large-scale vision-language model for video understanding and generation.

3.5K
Archived
Python
LLM Frameworks
Computer Vision
Python
#large-vision-language-model#video-understanding#multi-modal

QiuYannnn/Local-File-Organizer

An AI-powered file management tool that organizes local files while ensuring privacy.

3.1K
Archived
Python
LLM Frameworks
LLM Wrappers & SDKs
Python
#file-organizer#llama3#llm

mbzuai-oryx/Video-ChatGPT

A video conversation model that combines LLM capabilities with pretrained visual encoders for video-based chatbots.

1.5K
Experimental
Python
LLM Frameworks
Computer Vision
PyTorch
#chatbot#video-conversation#vision-language

lxtGH/OMG-Seg

Official codebase for OMG-LLaVA and OMG-Seg, state-of-the-art computer vision models presented at CVPR-24 and NeurIPS-24.

1.3K
Stable
Python
Computer Vision
LLM Frameworks
Python
#computer-vision#llm#inference

jhc13/taggui

A Python-based tool for managing and captioning image datasets, with support for various AI models and frameworks.

1.3K
Stable
Python
Computer Vision
Component Libraries (React)
#image-tagging#image-captioning#llava

unum-cloud/UForm

Multimodal AI toolkit for fast content understanding and generation across text, images, and video

1.2K
Stable
Python
LLM Frameworks
Computer Vision
PyTorch
#multimodal-ai#cross-modal#semantic-search

gokayfem/awesome-vlm-architectures

A curated list of famous vision-language models and their architectures for developers working with AI tools.

1.2K
Active
Markdown
LLM Frameworks
Frontend Frameworks
React
#vision-language-models#multimodal#llm

Stay in the loop

Get weekly updates on trending AI coding tools and projects.