Gradformer: Graph Transformer with Exponential Decay
Chuang Liu, Zelin Yao, Yibing Zhan, Xueqi Ma, Shirui Pan, Wenbin Hu

TL;DR
Gradformer introduces an exponential decay mask in Graph Transformers to better incorporate graph structure biases, improving performance and training stability across multiple graph tasks.
Contribution
The paper proposes Gradformer, which integrates an exponential decay attention mask with learnable constraints, enhancing structural bias modeling and deep model training in Graph Transformers.
Findings
Outperforms baseline models on various benchmarks.
Maintains accuracy in deep models better than existing GTs.
Effective in both classification and regression tasks.
Abstract
Graph Transformers (GTs) have demonstrated their advantages across a wide range of tasks. However, the self-attention mechanism in GTs overlooks the graph's inductive biases, particularly biases related to structure, which are crucial for the graph tasks. Although some methods utilize positional encoding and attention bias to model inductive biases, their effectiveness is still suboptimal analytically. Therefore, this paper presents Gradformer, a method innovatively integrating GT with the intrinsic inductive bias by applying an exponential decay mask to the attention matrix. Specifically, the values in the decay mask matrix diminish exponentially, correlating with the decreasing node proximities within the graph structure. This design enables Gradformer to retain its ability to capture information from distant nodes while focusing on the graph's local details. Furthermore, Gradformer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Fluorescence Microscopy Techniques
MethodsGoal-Driven Tree-Structured Neural Model · Exponential Decay · Graph Neural Network
