Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation
Using Fast Weights to Attend to the Recent Past
Learning Wake-Sleep Recurrent Attention Models
Do Deep Nets Really Need to be Deep?
Adaptive dropout for training deep neural networks