NIPS Proceedingsβ

Hado P. van Hasselt

1 Paper

  • Weighted importance sampling for off-policy learning with linear function approximation (2014)