Loading paper
Neural gradients are near-lognormal: improved quantized and sparse training | Tomesphere