Showing 1-9 of 9 projects
A collection of multimodal large language models and their latest advances.
This GitHub repository is a collection of AI-related papers, datasets, and applications focused on prompt engineering and large language models.
A curated list of must-read papers on large language model agents and their applications.
A vision foundation model from BAAI for generalist painting and segmentation tasks.
A PyTorch implementation of VALL-E, a zero-shot text-to-speech model for vibe coders.
An open-source library for building generative multimodal AI models, with a focus on foundation models, in-context learning, and multimodal pretraining.
Comprehensive resources for in-context learning and prompt engineering for large language models like ChatGPT and GPT-3.
UNO is a universal customization method for both single and multi-subject image generation using diffusion models.
A curated collection of papers on generative information extraction using large language models.
Get weekly updates on trending AI coding tools and projects.