Offline Reinforcement Learning Workshop
Neural Information Processing Systems (NeurIPS)
December 12, 2020
@OfflineRL ยท #OFFLINERL2020
Optimal Mixture Weights for Off-Policy Evaluation with Multiple Behavior Policies
- Jinlin Lai, Lixin Zou, Jiaxing Song
-   PDF   Supplement