NIPS Proceedings
β
Books
Philip S. Thomas
4 Papers
Policy Evaluation Using the Ω-Return
(2015)
Projected Natural Actor-Critic
(2013)
Policy Gradient Coagent Networks
(2011)
TD_gamma: Re-evaluating Complex Backups in Temporal Difference Learning
(2011)