Offline Reinforcement Learning Workshop

Neural Information Processing Systems (NeurIPS)

December 12, 2020

@OfflineRL ยท #OFFLINERL2020


Gradient Analysis and Approximations for Off-policy Optimization

  • Ramki Gummadi, Dale Schuurmans
  •   PDF