Offline Reinforcement Learning Workshop
Neural Information Processing Systems (NeurIPS)
December 12, 2020
@OfflineRL · #OFFLINERL2020
Q-Value Weighted Regression:Reinforcement Learning with Limited Data
- Piotr Kozakowski, Łukasz Kaiser, Henryk Michalewski, Afroz Mohiuddin, Katarzyna Kańska
-   PDF   Supplement