Loading paper
Convex Dominance in Deep Learning I: A Scaling Law of Loss and Learning Rate | Tomesphere