Loading paper
Batch Normalization Is Blind to the First and Second Derivatives of the Loss | Tomesphere