Offline Reinforcement Learning Workshop

Neural Information Processing Systems (NeurIPS)

December 12, 2020

@OfflineRL ยท #OFFLINERL2020


Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning

  • Shangtong Zhang, Bo Liu, Shimon Whiteson
  •   PDF