The reviewers gave mixed scores and raised various issues with respect to the evaluation and the novelty compared to the ICLR 2018 workshop paper. However, I felt that accepting the paper at this time would be valuable to the NeurIPS community as it combines pre-existing components in a reasonable way, yielding both good performance improvement and providing useful insights about these models. I would encourage the authors to address the issues raised by the reviewers in the camera-ready.