Reviews: SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits

The paper considers the multiplayer (stochastic) MAB problem in which M players compete over K > M arms, where the reward of an arm is sampled i.i.d. unless there is a collision (two or more players pull the same arm) in which case the reward is zero. The authors give new algorithms with regret bounds comparable to the best existing bounds for centralized algorithms. The reviewers unanimously agreed (and I concur) that the results are significant and the paper is well-written. A clear accept.

Paper ID:	6497
Title:	SIC-MMAB: Synchronisation Involves Communication in Multiplayer Multi-Armed Bandits