Automatic Graph Topology-Aware Transformer
Chao Wang, Jiaxuan Zhao, Lingling Li, Licheng Jiao, Fang, Liu, Shuyuan Yang

TL;DR
This paper introduces EGTAS, an automated framework for designing effective graph Transformer architectures through evolutionary search, reducing manual effort and achieving competitive performance across various graph tasks.
Contribution
It presents a comprehensive search space and a surrogate performance prediction model for automated graph Transformer architecture search.
Findings
EGTAS constructs high-performance graph Transformers.
Automated search rivals manual and baseline methods.
Effective across diverse graph datasets and tasks.
Abstract
Existing efforts are dedicated to designing many topologies and graph-aware strategies for the graph Transformer, which greatly improve the model's representation capabilities. However, manually determining the suitable Transformer architecture for a specific graph dataset or task requires extensive expert knowledge and laborious trials. This paper proposes an evolutionary graph Transformer architecture search framework (EGTAS) to automate the construction of strong graph Transformers. We build a comprehensive graph Transformer search space with the micro-level and macro-level designs. EGTAS evolves graph Transformer topologies at the macro level and graph-aware strategies at the micro level. Furthermore, a surrogate model based on generic architectural coding is proposed to directly predict the performance of graph Transformers, substantially reducing the evaluation cost of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGraph Theory and Algorithms · Advanced Graph Neural Networks
MethodsAttention Is All You Need · Laplacian EigenMap · Laplacian Positional Encodings · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Position-Wise Feed-Forward Layer · Multi-Head Attention
