What can linear interpolation of neural network loss landscapes tell us?

Tiffany Vlaar; Jonathan Frankle

arXiv:2106.16004·cs.LG·February 4, 2022

What can linear interpolation of neural network loss landscapes tell us?

Tiffany Vlaar, Jonathan Frankle

PDF

TL;DR

This paper critically examines the use of linear interpolation in neural network loss landscapes, revealing that such visualizations may not reliably indicate optimization difficulty or model performance.

Contribution

The study systematically tests how linear interpolation results vary with different data, initializations, and architectures, challenging previous assumptions about their interpretability.

Findings

01

Linear interpolation shape does not correlate with test accuracy.

02

Certain layers are more sensitive to initialization.

03

Barriers in loss landscapes may not indicate optimization success.

Abstract

Studying neural network loss landscapes provides insights into the nature of the underlying optimization problems. Unfortunately, loss landscapes are notoriously difficult to visualize in a human-comprehensible fashion. One common way to address this problem is to plot linear slices of the landscape, for example from the initial state of the network to the final state after optimization. On the basis of this analysis, prior work has drawn broader conclusions about the difficulty of the optimization problem. In this paper, we put inferences of this kind to the test, systematically evaluating how linear interpolation and final performance vary when altering the data, choice of initialization, and other optimizer and architecture design choices. Further, we use linear interpolation to study the role played by individual layers and substructures of the network. We find that certain layers…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning