Explore Projects

Discover 23 open source projects

Active filters (1):
Search: rlhfร—
Clear all

Showing 21-23 of 23 projects

PKU-Alignment/safe-rlhf

A safe reinforcement learning from human feedback (RLHF) system for aligning large language models with human values.

1.6K
Stable
Python
LLM Frameworks
Reinforcement Learning
#ai-safety#large-language-models#reinforcement-learning

OpenLMLab/MOSS-RLHF

A Python library for exploring secrets of RLHF (Reward-Weighted Maximum Likelihood Estimation) in large language models

1.4K
Archived
Python
LLM Frameworks
Agents & Orchestration
Python
#ai-safety#alignment#rlhf

xtreme1-io/xtreme1

An all-in-one data labeling and annotation platform for multimodal data training, supporting 3D LiDAR, images, and language models.

1.2K
Experimental
TypeScript
Computer Vision
Inference
TypeScript
#3d-annotation#annotation-tool#lidar-annotation
1

Stay in the loop

Get weekly updates on trending AI coding tools and projects.