Sparse Graph Attention Networks
Yang Ye, and Shihao Ji

TL;DR
This paper introduces Sparse Graph Attention Networks (SGATs) that learn to prune irrelevant edges in large graphs using $L_0$-norm regularization, improving performance and interpretability in graph learning tasks.
Contribution
SGATs are the first to effectively learn sparse attention coefficients for edge pruning, enhancing GNN performance on noisy and large graphs.
Findings
SGATs remove 50-80% of edges in large graphs while maintaining accuracy.
SGATs outperform GATs on disassortative graphs by significant margins.
Removed edges are interpretable and can be quantitatively analyzed.
Abstract
Graph Neural Networks (GNNs) have proved to be an effective representation learning framework for graph-structured data, and have achieved state-of-the-art performance on many practical predictive tasks, such as node classification, link prediction and graph classification. Among the variants of GNNs, Graph Attention Networks (GATs) learn to assign dense attention coefficients over all neighbors of a node for feature aggregation, and improve the performance of many graph learning tasks. However, real-world graphs are often very large and noisy, and GATs are prone to overfitting if not regularized properly. Even worse, the local aggregation mechanism of GATs may fail on disassortative graphs, where nodes within local neighborhood provide more noise than useful information for feature aggregation. In this paper, we propose Sparse Graph Attention Networks (SGATs) that learn sparse…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Complex Network Analysis Techniques · Topic Modeling
