NIPS Proceedings
β
Books
Borja Ibarz
1 Paper
Reward learning from human preferences and demonstrations in Atari
(2018)