NeurIPS 2020
Bias no more: high-probability data-dependent regret bounds for adversarial bandits and MDPs
Meta Review
There is a strong agreement among reviewers that this is a very good work.