NIPS Proceedingsβ

Shane Legg

3 Papers

  • Reward learning from human preferences and demonstrations in Atari (2018)
  • Deep Reinforcement Learning from Human Preferences (2017)
  • Temporal Difference Updating without a Learning Rate (2007)