Loading paper
Beyond the Mean: Fisher-Orthogonal Projection for Natural Gradient Descent in Large Batch Training | Tomesphere