Loading paper
Scaling Laws of SignSGD in Linear Regression: When Does It Outperform SGD? | Tomesphere