Showing 1-5 of 5 projects
Multimodal large language model series for developers
A Python library for using and fine-tuning over 900 large language models and multimodal models for various AI tasks.
A collection of tutorials and notebooks on state-of-the-art computer vision models and techniques for developers.
An open-source implementation for fine-tuning Qwen-VL series, a multimodal vision-language model by Alibaba Cloud.
Control Android phones programmatically using Qwen3-VL vision model for UI automation and device interaction.
Get weekly updates on trending AI coding tools and projects.