NIPS Proceedingsβ

Randy Jia

1 Paper

  • Optimistic posterior sampling for reinforcement learning: worst-case regret bounds (2017)