NIPS Proceedingsβ

Shane Legg

2 Papers

  • Deep Reinforcement Learning from Human Preferences (2017)
  • Temporal Difference Updating without a Learning Rate (2007)