Unveiling the Hessian's Connection to the Decision Boundary
Mahalakshmi Sabanayagam, Freya Behrens, Urte Adomaityte, Anna Dawid

TL;DR
This paper reveals that the Hessian's top eigenvectors are linked to the decision boundary complexity in neural networks, enabling new ways to measure and identify well-generalizing minima with wide-margin boundaries.
Contribution
It establishes a novel connection between the Hessian spectrum and decision boundary complexity, introducing new measures and techniques for analyzing neural network minima.
Findings
Hessian top eigenvectors characterize decision boundary properties
Number of Hessian outliers correlates with boundary complexity
Proposed methods accurately identify minima with wide-margin boundaries
Abstract
Understanding the properties of well-generalizing minima is at the heart of deep learning research. On the one hand, the generalization of neural networks has been connected to the decision boundary complexity, which is hard to study in the high-dimensional input space. Conversely, the flatness of a minimum has become a controversial proxy for generalization. In this work, we provide the missing link between the two approaches and show that the Hessian top eigenvectors characterize the decision boundary learned by the neural network. Notably, the number of outliers in the Hessian spectrum is proportional to the complexity of the decision boundary. Based on this finding, we provide a new and straightforward approach to studying the complexity of a high-dimensional decision boundary; show that this connection naturally inspires a new generalization measure; and finally, we develop a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Face and Expression Recognition · Adversarial Robustness in Machine Learning
