Group Robust Preference Optimization in Reward-free RLHF

Item #:
079017-1171

Details

Description

 

Members/Attendees

 

Tab 4