NIPS Proceedingsβ

Tom Stepleton

1 Paper

  • Safe and Efficient Off-Policy Reinforcement Learning (2016)