Improving Graph Attention Networks with Large Margin-based Constraints
Guangtao Wang, Rex Ying, Jing Huang, Jure Leskovec

TL;DR
This paper introduces a margin-based constraint framework for Graph Attention Networks to mitigate over-fitting and over-smoothing, resulting in improved performance on benchmark datasets.
Contribution
The paper proposes a novel margin-based constraint approach for GATs that addresses over-smoothing and over-fitting issues, enhancing their effectiveness.
Findings
Significant performance improvements over state-of-the-art GATs on benchmarks.
Theoretical demonstration of over-smoothing in GATs.
Effective constraints on attention weights and graph structure improve generalization.
Abstract
Graph Attention Networks (GATs) are the state-of-the-art neural architecture for representation learning with graphs. GATs learn attention functions that assign weights to nodes so that different nodes have different influences in the feature aggregation steps. In practice, however, induced attention functions are prone to over-fitting due to the increasing number of parameters and the lack of direct supervision on attention weights. GATs also suffer from over-smoothing at the decision boundary of nodes. Here we propose a framework to address their weaknesses via margin-based constraints on attention during training. We first theoretically demonstrate the over-smoothing behavior of GATs and then develop an approach using constraint on the attention weights according to the class boundary and feature aggregation pattern. Furthermore, to alleviate the over-fitting problem, we propose…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Topic Modeling · Graph Theory and Algorithms
