The authors claim that NN architectures have directional inductive bias, which impacts test accuracy and training speed. They show how to compute such directions using spectral decomposition of network derivatives as a proxy for training speed. Consensus among the reviewers that this is an interesting direction to analyze inductive biases that are implicit in the network architecture.