Loading paper
Parle: parallelizing stochastic gradient descent | Tomesphere