NIPS Proceedingsβ

Damien Vincent

1 Paper

  • Adaptive Temporal-Difference Learning for Policy Evaluation with Per-State Uncertainty Estimates (2019)