NIPS Proceedingsβ

Miljan Martic

1 Paper

  • Deep Reinforcement Learning from Human Preferences (2017)