Loading paper
Scaling Collapse Reveals Universal Dynamics in Compute-Optimally Trained Neural Networks | Tomesphere