Showing 1-3 of 3 projects
An open-source, large language model-based multimodal dialogue system that achieves near-GPT-4o performance.
Chinese version of CLIP for cross-modal retrieval and representation generation
PyTorch code for Bootstrapping Language-Image Pre-training for Unified Vision-Language Understanding and Generation
Get weekly updates on trending AI coding tools and projects.