NIPS Proceedingsβ

Borja Ibarz

1 Paper

  • Reward learning from human preferences and demonstrations in Atari (2018)