Offline Reinforcement Learning Workshop

Neural Information Processing Systems (NeurIPS)

December 12, 2020

@OfflineRL ยท #OFFLINERL2020


On Sampling Error in Batch Action-Value Prediction Algorithms

  • Brahma S. Pavse, Josiah P. Hanna, Ishan Durugkar, Peter Stone
  •   PDF