NIPS Proceedings
β
Books
A. Rupam Mahmood
1 Paper
Weighted importance sampling for off-policy learning with linear function approximation
(2014)