Rethinking Graph Generalization through the Lens of Sharpness-Aware Minimization
Yang Qiu, Yixiong Zou, Jun Wang

TL;DR
This paper investigates the phenomenon of minimal shift flip in graph neural networks, linking it to loss landscape sharpness via Sharpness-Aware Minimization, and proposes an energy-based augmentation method to improve out-of-distribution generalization.
Contribution
It introduces the Local Robust Radius concept, connects loss sharpness to generalization error, and develops E2A, a novel energy-driven augmentation framework for better graph OOD generalization.
Findings
E2A outperforms state-of-the-art methods on multiple benchmarks.
Robust radius decreases during training, indicating increased sharpness.
Energy-based augmentation effectively generates pseudo-OOD samples.
Abstract
Graph Neural Networks (GNNs) have achieved remarkable success across various graph-based tasks but remain highly sensitive to distribution shifts. In this work, we focus on a prevalent yet under-explored phenomenon in graph generalization, Minimal Shift Flip (MSF),where test samples that slightly deviate from the training distribution are abruptly misclassified. To interpret this phenomenon, we revisit MSF through the lens of Sharpness-Aware Minimization (SAM), which characterizes the local stability and sharpness of the loss landscape while providing a theoretical foundation for modeling generalization error. To quantify loss sharpness, we introduce the concept of Local Robust Radius, measuring the smallest perturbation required to flip a prediction and establishing a theoretical link between local stability and generalization. Building on this perspective, we further observe a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Topic Modeling
