NIPS Proceedings
β
Books
Paul Wagner
2 Papers
Optimistic policy iteration and natural actor-critic: A unifying view and a non-optimality result
(2013)
A reinterpretation of the policy oscillation phenomenon in approximate policy iteration
(2011)