NIPS Proceedingsβ

Philip S. Thomas

4 Papers

  • Policy Evaluation Using the Ω-Return (2015)
  • Projected Natural Actor-Critic (2013)
  • Policy Gradient Coagent Networks (2011)
  • TD_gamma: Re-evaluating Complex Backups in Temporal Difference Learning (2011)