Offline Reinforcement Learning Workshop
Neural Information Processing Systems (NeurIPS)
December 12, 2020
@OfflineRL ยท #OFFLINERL2020
On the Convergence Rate of Density Ratio Learning Based Off-Policy Policy Gradient Methods
- Jiawei Huang*, Nan Jiang
-   PDF   Supplement