Offline Reinforcement Learning Workshop
Neural Information Processing Systems (NeurIPS)
December 12, 2020
@OfflineRL ยท #OFFLINERL2020
Near-Optimal Provable Uniform Convergence in Offline Policy Evaluation for Reinforcement Learning
- Ming Yin, Yu Bai, and Yu-Xiang Wang
-   PDF   Supplement