NIPS Proceedingsβ

Andras Antos

1 Paper

  • Online Markov Decision Processes under Bandit Feedback (2010)