Showing 1-3 of 3 projects
A minimal reproduction of DeepSeek R1-Zero, a tool for vibe coders building with AI tools.
A Python-based library for solving visual understanding tasks using reinforced visual-linguistic models (VLMs).
A critical perspective on understanding R1-Zero-Like Training, a technique for large language models.
Get weekly updates on trending AI coding tools and projects.