Loading paper
Gradient descent with identity initialization efficiently learns positive definite linear transformations by deep residual networks | Tomesphere