Offline Reinforcement Learning Workshop

Neural Information Processing Systems (NeurIPS)

December 12, 2020

@OfflineRL · #OFFLINERL2020

On the Convergence Rate of Density Ratio Learning Based Off-Policy Policy Gradient Methods

Jiawei Huang*, Nan Jiang
PDF Supplement