Loading paper
O(1) Communication for Distributed SGD through Two-Level Gradient Averaging | Tomesphere