Regret-Optimal Model-Free Reinforcement Learning for Discounted MDPs with Short Burn-In Time

Item #:
075280-3537

Details

Description

 

Members/Attendees

 

Tab 4