Loading paper
Averaging Weights Leads to Wider Optima and Better Generalization | Tomesphere