PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning
Meta Review
The reviewers unanimously appreciated the paper. The author response's clarified some of their concerns, in particular about POLITEX. Please incorporate the reviewers' feedback into your revisions.