Loading paper
Neural Scaling Laws Rooted in the Data Distribution | Tomesphere