NIPS Proceedingsβ

Dario Amodei

2 Papers

  • Reward learning from human preferences and demonstrations in Atari (2018)
  • Deep Reinforcement Learning from Human Preferences (2017)