Understanding Attention and Generalization in Graph Neural Networks

Boris Knyazev; Graham W. Taylor; Mohamed R. Amer

arXiv:1905.02850·cs.LG·October 29, 2019·55 cites

Understanding Attention and Generalization in Graph Neural Networks

Boris Knyazev, Graham W. Taylor, Mohamed R. Amer

PDF

Open Access 2 Repos

TL;DR

This paper investigates the role of attention mechanisms in graph neural networks, revealing conditions under which attention improves performance and proposing a weakly-supervised training method to enhance generalization on various datasets.

Contribution

It provides a controlled study of attention in GNNs, identifies when attention is beneficial, and introduces a weakly-supervised training approach to improve attention effectiveness.

Findings

01

Attention can be harmful or negligible under typical conditions.

02

Properly trained attention can boost classification performance by over 60%.

03

Weakly-supervised training of attention approaches supervised performance and outperforms unsupervised methods.

Abstract

We aim to better understand attention over nodes in graph neural networks (GNNs) and identify factors influencing its effectiveness. We particularly focus on the ability of attention GNNs to generalize to larger, more complex or noisy graphs. Motivated by insights from the work on Graph Isomorphism Networks, we design simple graph reasoning tasks that allow us to study attention in a controlled environment. We find that under typical conditions the effect of attention is negligible or even harmful, but under certain conditions it provides an exceptional gain in performance of more than 60% in some of our classification tasks. Satisfying these conditions in practice is challenging and often requires optimal initialization or supervised training of attention. We propose an alternative recipe and train attention in a weakly-supervised fashion that approaches the performance of supervised…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Topic Modeling · Explainable Artificial Intelligence (XAI)