NIPS Proceedingsβ

Hado P. van Hasselt

2 Papers

  • Learning values across many orders of magnitude (2016)
  • Weighted importance sampling for off-policy learning with linear function approximation (2014)