NeurIPS 2020

Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization

Meta Review

Even before the author response, the reviewers agreed that the results and approach were interesting. The response addressed the reviewers remaining concerns about novelty, baseline strength, and positioning with respect to prior work. This led the reviewers to a consensus that the paper should be accepted.