NIPS Proceedingsβ

Tobias Pohlen

1 Paper

  • Reward learning from human preferences and demonstrations in Atari (2018)