Showing 1-1 of 1 projects
A modular reinforcement learning library to fine-tune language models to human preferences
Get weekly updates on trending AI coding tools and projects.