All reviewers support acceptance of this paper, and I would also like to recommend acceptance. All reviewers point that this is an interesting a novel approach to learning who to communicate to in a multi-agent setup, which is both interesting from a research perspective but also useful in practical applications of multi-agent communication. Moreover, this paper is well executed, with clear statements supported by sufficient experiments and baselines. Finally, R1 and R2 have expressed concerns regarding the low performance of IC3Net and TarMAC. Authors have provided an explanation in the author response with some more experiments with regards to team vs individual rewards. I think these points should also be incorporated in the manuscript for completeness.