Loading paper
Scaling Law with Learning Rate Annealing | Tomesphere