Loading paper
OD-SGD: One-step Delay Stochastic Gradient Descent for Distributed Training | Tomesphere