Gaussian Process Limit Reveals Structural Benefits of Graph Transformers
Nil Ayday, Lingchu Yang, Debarghya Ghoshdastidar

TL;DR
This paper provides a theoretical analysis of graph transformers, showing they have structural advantages over traditional graph convolutional networks, especially in preserving community information and preventing oversmoothing, supported by empirical validation.
Contribution
It introduces a Gaussian process framework for analyzing graph transformers, revealing their ability to maintain discriminative features and community structure in deep layers, which is a novel theoretical insight.
Findings
Graph transformers structurally preserve community information.
They maintain discriminative node representations in deep layers.
Empirical results validate theoretical predictions and show performance improvements.
Abstract
Graph transformers are the state-of-the-art for learning from graph-structured data and are empirically known to avoid several pitfalls of message-passing architectures. However, there is limited theoretical analysis on why these models perform well in practice. In this work, we prove that attention-based architectures have structural benefits over graph convolutional networks in the context of node-level prediction tasks. Specifically, we study the neural network gaussian process limits of graph transformers (GAT, Graphormer, Specformer) with infinite width and infinite heads, and derive the node-level and edge-level kernels across the layers. Our results characterise how the node features and the graph structure propagate through the graph attention layers. As a specific example, we prove that graph transformers structurally preserve community information and maintain discriminative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Healthcare · Functional Brain Connectivity Studies
