NeurIPS 2020

PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning


Meta Review

The reviewers unanimously appreciated the paper. The author response's clarified some of their concerns, in particular about POLITEX. Please incorporate the reviewers' feedback into your revisions.