Offline Reinforcement Learning Workshop

Neural Information Processing Systems (NeurIPS)

December 12, 2020

@OfflineRL ยท #OFFLINERL2020


The Importance of Pessimism in Fixed-Dataset Policy Optimization

  • Jacob Buckman, Carles Gelada, Marc G. Bellemare
  •   PDF