Offline Reinforcement Learning Workshop
Neural Information Processing Systems (NeurIPS)
December 12, 2020
@OfflineRL ยท #OFFLINERL2020
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning
- Shangtong Zhang, Bo Liu, Shimon Whiteson
-   PDF