Towards Deep Attention in Graph Neural Networks: Problems and Remedies
Soo Yong Lee, Fanchen Bu, Jaemin Yoo, Kijung Shin

TL;DR
This paper investigates the challenges of deep graph attention in GNNs, identifies key problems like over-smoothing, and proposes AERO-GNN, a new architecture that effectively mitigates these issues and improves performance on node classification tasks.
Contribution
The paper introduces AERO-GNN, a novel deep graph attention architecture that addresses over-smoothing and attention smoothness issues, with theoretical guarantees and empirical validation.
Findings
AERO-GNN mitigates over-smoothing in deep layers.
It achieves higher accuracy on multiple node classification benchmarks.
Deep attention in GNNs can be effectively stabilized with the proposed methods.
Abstract
Graph neural networks (GNNs) learn the representation of graph-structured data, and their expressiveness can be further enhanced by inferring node relations for propagation. Attention-based GNNs infer neighbor importance to manipulate the weight of its propagation. Despite their popularity, the discussion on deep graph attention and its unique challenges has been limited. In this work, we investigate some problematic phenomena related to deep graph attention, including vulnerability to over-smoothed features and smooth cumulative attention. Through theoretical and empirical analyses, we show that various attention-based GNNs suffer from these problems. Motivated by our findings, we propose AEROGNN, a novel GNN architecture designed for deep graph attention. AERO-GNN provably mitigates the proposed problems of deep graph attention, which is further empirically demonstrated with (a) its…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Graph Neural Networks · Advanced Memory and Neural Computing · Ferroelectric and Negative Capacitance Devices
