Loading paper
Adaptive Gradient Quantization for Data-Parallel SGD | Tomesphere