Trajectory Data Suffices for Statistically Efficient Learning in Offline RL with Linear $q^\pi$-Realizability and Concentrability

Item #:
079017-2649

Details

Description

 

Members/Attendees

 

Tab 4