Graph Transformers: A Survey
Ahsan Shehzad, Feng Xia, Shagufta Abid, Ciyuan Peng, Shuo Yu, Dongyu Zhang, Karin Verspoor

TL;DR
This survey reviews recent advances in graph transformer models, discussing their design, applications, challenges, and future research directions in graph-structured data analysis.
Contribution
It provides a comprehensive taxonomy and analysis of graph transformer architectures, integrating recent progress and identifying key challenges in the field.
Findings
Graph transformers demonstrate strong performance across various tasks.
Design principles include integrating graph inductive biases and attention mechanisms.
Remaining challenges include scalability, robustness, and interpretability.
Abstract
Graph transformers are a recent advancement in machine learning, offering a new class of neural network models for graph-structured data. The synergy between transformers and graph learning demonstrates strong performance and versatility across various graph-related tasks. This survey provides an in-depth review of recent progress and challenges in graph transformer research. We begin with foundational concepts of graphs and transformers. We then explore design perspectives of graph transformers, focusing on how they integrate graph inductive biases and graph attention mechanisms into the transformer architecture. Furthermore, we propose a taxonomy classifying graph transformers based on depth, scalability, and pre-training strategies, summarizing key principles for effective development of graph transformer models. Beyond technical analysis, we discuss the applications of graph…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsAttention Is All You Need · Residual Connection · Byte Pair Encoding · Layer Normalization · Laplacian EigenMap · Label Smoothing · Linear Layer · Adam · Dropout · Multi-Head Attention
