NIPS Proceedingsβ

Masatoshi Uehara

1 Paper

  • Intrinsically Efficient, Stable, and Bounded Off-Policy Evaluation for Reinforcement Learning (2019)