Unveiling the Hessian's Connection to the Decision Boundary

Mahalakshmi Sabanayagam; Freya Behrens; Urte Adomaityte; Anna Dawid

arXiv:2306.07104·cs.LG·June 13, 2023·1 cites

Unveiling the Hessian's Connection to the Decision Boundary

Mahalakshmi Sabanayagam, Freya Behrens, Urte Adomaityte, Anna Dawid

PDF

Open Access 1 Repo

TL;DR

This paper reveals that the Hessian's top eigenvectors are linked to the decision boundary complexity in neural networks, enabling new ways to measure and identify well-generalizing minima with wide-margin boundaries.

Contribution

It establishes a novel connection between the Hessian spectrum and decision boundary complexity, introducing new measures and techniques for analyzing neural network minima.

Findings

01

Hessian top eigenvectors characterize decision boundary properties

02

Number of Hessian outliers correlates with boundary complexity

03

Proposed methods accurately identify minima with wide-margin boundaries

Abstract

Understanding the properties of well-generalizing minima is at the heart of deep learning research. On the one hand, the generalization of neural networks has been connected to the decision boundary complexity, which is hard to study in the high-dimensional input space. Conversely, the flatness of a minimum has become a controversial proxy for generalization. In this work, we provide the missing link between the two approaches and show that the Hessian top eigenvectors characterize the decision boundary learned by the neural network. Notably, the number of outliers in the Hessian spectrum is proportional to the complexity of the decision boundary. Based on this finding, we provide a new and straightforward approach to studying the complexity of a high-dimensional decision boundary; show that this connection naturally inspires a new generalization measure; and finally, we develop a novel…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shmoo137/hessian-and-decision-boundary
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopological and Geometric Data Analysis · Face and Expression Recognition · Adversarial Robustness in Machine Learning