Offline Reinforcement Learning Workshop

Neural Information Processing Systems (NeurIPS)

December 12, 2020

@OfflineRL ยท #OFFLINERL2020


Counterfactual Policy Evaluation and the Conditional Monte Carlo Method

  • Michel Ma, Pierre-Luc Bacon
  •   PDF