Showing 21-23 of 23 projects
A safe reinforcement learning from human feedback (RLHF) system for aligning large language models with human values.
A Python library for exploring secrets of RLHF (Reward-Weighted Maximum Likelihood Estimation) in large language models
An all-in-one data labeling and annotation platform for multimodal data training, supporting 3D LiDAR, images, and language models.
Get weekly updates on trending AI coding tools and projects.