Explore Projects

Discover 31 open source projects

Active filters (1):
Search: vlmร—
Clear all

Showing 21-31 of 31 projects

coderonion/awesome-yolo-object-detection

A curated collection of YOLO object detection projects and datasets for developers working with computer vision and AI.

1.7K
Experimental
Computer Vision
Datasets
#object-detection#yolo#datasets

kijai/ComfyUI-Florence2

Inference tool for Microsoft's Florence2 Versatile Language Model (VLM), built for vibe coders using AI tools.

1.6K
Active
Python
LLM Frameworks
Inference
Python
#llm#language-model#inference

alibaba/Pai-Megatron-Patch

Official repo for Pai-Megatron-Patch, a large language model and visual language model training framework developed by Alibaba Cloud.

1.5K
Stable
Python
LLM Frameworks
ML Ops
Python
#large-language-model#visual-language-model#distributed-training

zapdos-labs/unblink

Real-time camera monitoring with VLM using TypeScript and React.

1.3K
Active
TypeScript
VLM
Next.js
#real-time#camera#VLM

yueliu1999/Awesome-Jailbreak-on-LLMs

A collection of novel jailbreak methods for large language models (LLMs) focused on privacy and safety.

1.2K
Active
LLM Frameworks
Privacy Tools
#llms#privacy#safety

gokayfem/awesome-vlm-architectures

A curated list of famous vision-language models and their architectures for developers working with AI tools.

1.2K
Active
Markdown
LLM Frameworks
Frontend Frameworks
React
#vision-language-models#multimodal#llm

zai-org/CogAgent

An open-source end-to-end VLM-based GUI agent for developers building with AI tools.

1.1K
Experimental
Python
Agents & Orchestration
AI Code Editors
React
#agent#gui#vlm

peterdsharpe/AeroSandbox

Aircraft design optimization made fast through computational graph transformations and composable analysis tools.

1.1K
Stable
Jupyter Notebook
Computer Vision
API Frameworks
#aerodynamics#aerospace#aircraft-design

TommyZihao/vlm_arm

A project exploring human-machine collaboration using a robotic arm, large language models, and multimodal AI.

1.1K
Experimental
Jupyter Notebook
Agents & Orchestration
Robotics
#robotics#multimodal-ai#human-machine-collaboration

fpgaminer/joycaption

JoyCaption is an open, uncensored image captioning Visual Language Model (VLM) for training Diffusion models.

1.1K
Stable
Jupyter Notebook
LLM Frameworks
Computer Vision
Jupyter Notebook
#captioning#joycaption#vlm

BAAI-DCAI/Bunny

A family of lightweight multimodal models for chatGPT, GPT-4, and other large language models.

1.1K
Archived
Python
LLM Frameworks
LLM Wrappers & SDKs
Python
#chatgpt#gpt-4#multimodal
1

Stay in the loop

Get weekly updates on trending AI coding tools and projects.