Would I Lie To You? Inference Time Alignment of Language Models using Direct Preference Heads

Item #:
079017-3022

Details

Description

 

Members/Attendees

 

Tab 4