Loading paper
Gradient descent aligns the layers of deep linear networks | Tomesphere