NIPS Proceedings
β
Books
Hado P. van Hasselt
1 Paper
Weighted importance sampling for off-policy learning with linear function approximation
(2014)