Loading paper
Parallel SGD: When does averaging help? | Tomesphere