What can linear interpolation of neural network loss landscapes tell us?
Tiffany Vlaar, Jonathan Frankle

TL;DR
This paper critically examines the use of linear interpolation in neural network loss landscapes, revealing that such visualizations may not reliably indicate optimization difficulty or model performance.
Contribution
The study systematically tests how linear interpolation results vary with different data, initializations, and architectures, challenging previous assumptions about their interpretability.
Findings
Linear interpolation shape does not correlate with test accuracy.
Certain layers are more sensitive to initialization.
Barriers in loss landscapes may not indicate optimization success.
Abstract
Studying neural network loss landscapes provides insights into the nature of the underlying optimization problems. Unfortunately, loss landscapes are notoriously difficult to visualize in a human-comprehensible fashion. One common way to address this problem is to plot linear slices of the landscape, for example from the initial state of the network to the final state after optimization. On the basis of this analysis, prior work has drawn broader conclusions about the difficulty of the optimization problem. In this paper, we put inferences of this kind to the test, systematically evaluating how linear interpolation and final performance vary when altering the data, choice of initialization, and other optimizer and architecture design choices. Further, we use linear interpolation to study the role played by individual layers and substructures of the network. We find that certain layers…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Advanced Neural Network Applications · Adversarial Robustness in Machine Learning
