AnchorGT: Efficient and Flexible Attention Architecture for Scalable Graph Transformers
Wenhao Zhu, Guojie Song, Liang Wang, Shaoguo Liu

TL;DR
AnchorGT introduces a scalable, flexible attention architecture for Graph Transformers that reduces computational complexity to nearly linear, enhances expressiveness beyond Weisfeiler-Lehman, and improves performance and efficiency across models.
Contribution
The paper proposes AnchorGT, a novel attention mechanism for Graph Transformers that is more scalable, expressive, and versatile, addressing limitations of quadratic complexity in existing models.
Findings
Achieves nearly linear complexity in attention computation.
Outperforms baseline models in accuracy and efficiency.
Proves superior expressiveness compared to Weisfeiler-Lehman test.
Abstract
Graph Transformers (GTs) have significantly advanced the field of graph representation learning by overcoming the limitations of message-passing graph neural networks (GNNs) and demonstrating promising performance and expressive power. However, the quadratic complexity of self-attention mechanism in GTs has limited their scalability, and previous approaches to address this issue often suffer from expressiveness degradation or lack of versatility. To address this issue, we propose AnchorGT, a novel attention architecture for GTs with global receptive field and almost linear complexity, which serves as a flexible building block to improve the scalability of a wide range of GT models. Inspired by anchor-based GNNs, we employ structurally important -dominating node set as anchors and design an attention mechanism that focuses on the relationship between individual nodes and anchors,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Memory and Neural Computing · Advanced Graph Neural Networks · Ferroelectric and Negative Capacitance Devices
MethodsSparse Evolutionary Training · Goal-Driven Tree-Structured Neural Model
