NIPS Proceedingsβ

Paul F. Christiano

1 Paper

  • Deep Reinforcement Learning from Human Preferences (2017)