Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling

Gregory W. Benton; Wesley J. Maddox; Sanae Lotfi; Andrew Gordon Wilson

arXiv:2102.13042·cs.LG·November 17, 2021·5 cites

Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling

Gregory W. Benton, Wesley J. Maddox, Sanae Lotfi, Andrew Gordon Wilson

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a method to construct multi-dimensional low-loss manifolds called simplicial complexes that connect multiple trained models, enabling fast ensembling with improved accuracy and robustness.

Contribution

It presents a novel approach to build mode-connecting simplicial complexes for efficient ensembling, outperforming traditional deep ensembles.

Findings

01

Constructed mode-connecting simplicial complexes for multiple models.

02

Achieved faster ensembling with higher accuracy and robustness.

03

Requires only a few training epochs to find low-loss simplices.

Abstract

With a better understanding of the loss surfaces for multilayer networks, we can build more robust and accurate training procedures. Recently it was discovered that independently trained SGD solutions can be connected along one-dimensional paths of near-constant training loss. In this paper, we show that there are mode-connecting simplicial complexes that form multi-dimensional manifolds of low loss, connecting many independently trained models. Inspired by this discovery, we show how to efficiently build simplicial complexes for fast ensembling, outperforming independently trained deep ensembles in accuracy, calibration, and robustness to dataset shift. Notably, our approach only requires a few training epochs to discover a low-loss simplex, starting from a pre-trained solution. Code is available at https://github.com/g-benton/loss-surface-simplexes.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

g-benton/loss-surface-simplexes
pytorchOfficial

Videos

Loss Surface Simplexes for Mode Connecting Volumes and Fast Ensembling· slideslive

Taxonomy

TopicsModel Reduction and Neural Networks · Neural Networks and Reservoir Computing · Random lasers and scattering media

MethodsDeep Ensembles · Stochastic Gradient Descent