NeurIPS 2020

Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs


Meta Review

There is a strong agreement among reviewers that this is a very good work.