NIPS Proceedingsβ

Matthew Soh

1 Paper

  • Stabilizing Off-Policy Q-Learning via Bootstrapping Error Reduction (2019)