Exploring Graph-Transformer Out-of-Distribution Generalization Abilities
Itay Niv, Neta Rabin

TL;DR
This paper investigates the out-of-distribution generalization capabilities of graph-transformer models, demonstrating their superior robustness over traditional message-passing neural networks across various benchmarks and proposing a novel analysis method.
Contribution
It systematically evaluates graph-transformers under distribution shifts, compares them with MPNNs, and introduces a model-agnostic clustering analysis approach for better understanding generalization.
Findings
Graph-transformers outperform MPNNs in OOD settings on most benchmarks.
Hybrid GT-MPNN backbones show strong generalization without specialized algorithms.
The proposed clustering analysis provides insights into domain alignment and class separation.
Abstract
Deep learning on graphs has shown remarkable success across numerous applications, including social networks, bio-physics, traffic networks, and recommendation systems. Regardless of their successes, current methods frequently depend on the assumption that training and testing data share the same distribution, a condition rarely met in real-world scenarios. While graph-transformer (GT) backbones have recently outperformed traditional message-passing neural networks (MPNNs) in multiple in-distribution (ID) benchmarks, their effectiveness under distribution shifts remains largely unexplored. In this work, we address the challenge of out-of-distribution (OOD) generalization for graph neural networks, with a special focus on the impact of backbone architecture. We systematically evaluate GT and hybrid backbones in OOD settings and compare them to MPNNs. To do so, we adapt several leading…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Rough Sets and Fuzzy Logic · Advanced Graph Neural Networks
MethodsGoal-Driven Tree-Structured Neural Model · Focus · Message Passing Neural Network · Sparse Evolutionary Training
