Offline Reinforcement Learning Workshop
Neural Information Processing Systems (NeurIPS)
December 12, 2020
@OfflineRL ยท #OFFLINERL2020
On Sampling Error in Batch Action-Value Prediction Algorithms
- Brahma S. Pavse, Josiah P. Hanna, Ishan Durugkar, Peter Stone
-   PDF