Authors propose a method for adaptive selection of data points for SGD. Specifically, authors use the ADAM method and extend it to adaptive sampling setting using multi-armed bandit. Proposed method is further analyzed and improvement in the convergence speed is quantified. Extensive empirical results also support the proposed method. All reviewers unanimously recommend accept. Clear accept.