Offline Reinforcement Learning Workshop
Neural Information Processing Systems (NeurIPS)
December 12, 2020
@OfflineRL ยท #OFFLINERL2020
EMaQ: Expected-Max Q-Learning Operator for Simple Yet Effective Offline and Online RL
- Seyed Kamyar Seyed Ghasemipour, Dale Schuurmans, Shixiang Gu
-   PDF