AnchorGT: Efficient and Flexible Attention Architecture for Scalable   Graph Transformers

Wenhao Zhu; Guojie Song; Liang Wang; Shaoguo Liu

arXiv:2405.03481·cs.LG·May 7, 2024

AnchorGT: Efficient and Flexible Attention Architecture for Scalable Graph Transformers

Wenhao Zhu, Guojie Song, Liang Wang, Shaoguo Liu

PDF

Open Access

TL;DR

AnchorGT introduces a scalable, flexible attention architecture for Graph Transformers that reduces computational complexity to nearly linear, enhances expressiveness beyond Weisfeiler-Lehman, and improves performance and efficiency across models.

Contribution

The paper proposes AnchorGT, a novel attention mechanism for Graph Transformers that is more scalable, expressive, and versatile, addressing limitations of quadratic complexity in existing models.

Findings

01

Achieves nearly linear complexity in attention computation.

02

Outperforms baseline models in accuracy and efficiency.

03

Proves superior expressiveness compared to Weisfeiler-Lehman test.

Abstract

Graph Transformers (GTs) have significantly advanced the field of graph representation learning by overcoming the limitations of message-passing graph neural networks (GNNs) and demonstrating promising performance and expressive power. However, the quadratic complexity of self-attention mechanism in GTs has limited their scalability, and previous approaches to address this issue often suffer from expressiveness degradation or lack of versatility. To address this issue, we propose AnchorGT, a novel attention architecture for GTs with global receptive field and almost linear complexity, which serves as a flexible building block to improve the scalability of a wide range of GT models. Inspired by anchor-based GNNs, we employ structurally important $k$ -dominating node set as anchors and design an attention mechanism that focuses on the relationship between individual nodes and anchors,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Memory and Neural Computing · Advanced Graph Neural Networks · Ferroelectric and Negative Capacitance Devices

MethodsSparse Evolutionary Training · Goal-Driven Tree-Structured Neural Model