Offline Reinforcement Learning Workshop
Neural Information Processing Systems (NeurIPS)
December 12, 2020
@OfflineRL ยท #OFFLINERL2020
The Importance of Pessimism in Fixed-Dataset Policy Optimization
- Jacob Buckman, Carles Gelada, Marc G. Bellemare
-   PDF