Offline Reinforcement Learning Workshop
Neural Information Processing Systems (NeurIPS)
December 12, 2020
@OfflineRL ยท #OFFLINERL2020
Double Explore-then-Commit: Asymptotic Optimality and Beyond
- Tianyuan Jin, Pan Xu, Xiaokui Xiao, Quanquan Gu
-   PDF   Supplement