Loading paper
Precise gradient descent training dynamics for finite-width multi-layer neural networks | Tomesphere