NIPS Proceedings
β
Books
Hamid R. Maei
2 Papers
Convergent Temporal-Difference Learning with Arbitrary Smooth Function Approximation
(2009)
A Convergent $O(n)$ Temporal-difference Algorithm for Off-policy Learning with Linear Function Approximation
(2008)