DAM-GT: Dual Positional Encoding-Based Attention Masking Graph Transformer for Node Classification

Chenyang Li; Jinsong Chen; John E. Hopcroft; Kun He

arXiv:2505.17660·cs.LG·May 26, 2025

DAM-GT: Dual Positional Encoding-Based Attention Masking Graph Transformer for Node Classification

Chenyang Li, Jinsong Chen, John E. Hopcroft, Kun He

PDF

TL;DR

DAM-GT introduces a dual positional encoding and attention masking mechanism to improve neighborhood token representation and focus in graph Transformers, significantly enhancing node classification accuracy.

Contribution

The paper proposes DAM-GT, a novel graph Transformer model with attribute-aware dual positional encoding and an attention masking strategy to better capture node relationships.

Findings

01

DAM-GT outperforms state-of-the-art methods on various graph datasets.

02

The dual encoding scheme effectively preserves attribute and topological correlations.

03

The attention masking reduces diversion, improving information interaction.

Abstract

Neighborhood-aware tokenized graph Transformers have recently shown great potential for node classification tasks. Despite their effectiveness, our in-depth analysis of neighborhood tokens reveals two critical limitations in the existing paradigm. First, current neighborhood token generation methods fail to adequately capture attribute correlations within a neighborhood. Second, the conventional self-attention mechanism suffers from attention diversion when processing neighborhood tokens, where high-hop neighborhoods receive disproportionate focus, severely disrupting information interactions between the target node and its neighborhood tokens. To address these challenges, we propose DAM-GT, Dual positional encoding-based Attention Masking graph Transformer. DAM-GT introduces a novel dual positional encoding scheme that incorporates attribute-aware encoding via an attribute clustering…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.