NeurIPS 2020

Sample Complexity of Asynchronous Q-Learning: Sharper Analysis and Variance Reduction

Meta Review

The reviewers appreciated the efforts made by the authors in the rebuttal, and updated their reviews accordingly. The paper contributions are now clear and important (an improved sample complexity analysis of asynchronous Q-learning, and a novel variance reduction algorithm and its analysis). We recommend the paper for acceptance and encourage the authors to account for the reviewers’ comments when preparing the camera-ready version of the paper.