NIPS Proceedingsβ

Jan Leike

1 Paper

  • Deep Reinforcement Learning from Human Preferences (2017)