NIPS Proceedingsβ

Dario Amodei

1 Paper

  • Deep Reinforcement Learning from Human Preferences (2017)