Loading paper
Data-Parallel Neural Network Training via Nonlinearly Preconditioned Trust-Region Method | Tomesphere