Showing 21-31 of 31 projects
A curated collection of YOLO object detection projects and datasets for developers working with computer vision and AI.
Inference tool for Microsoft's Florence2 Versatile Language Model (VLM), built for vibe coders using AI tools.
Official repo for Pai-Megatron-Patch, a large language model and visual language model training framework developed by Alibaba Cloud.
Real-time camera monitoring with VLM using TypeScript and React.
A collection of novel jailbreak methods for large language models (LLMs) focused on privacy and safety.
A curated list of famous vision-language models and their architectures for developers working with AI tools.
An open-source end-to-end VLM-based GUI agent for developers building with AI tools.
Aircraft design optimization made fast through computational graph transformations and composable analysis tools.
A project exploring human-machine collaboration using a robotic arm, large language models, and multimodal AI.
JoyCaption is an open, uncensored image captioning Visual Language Model (VLM) for training Diffusion models.
A family of lightweight multimodal models for chatGPT, GPT-4, and other large language models.
Get weekly updates on trending AI coding tools and projects.