TL;DR
This paper introduces NCMemo, a framework to quantify label memorization in GNNs, revealing how graph homophily influences memorization and proposing rewiring techniques to mitigate privacy risks.
Contribution
It is the first to analyze label memorization in GNNs, establishing the relationship with graph homophily and proposing methods to reduce memorization and enhance privacy.
Findings
Lower homophily increases GNN memorization.
Graph structure's informativeness affects memorization.
Graph rewiring reduces memorization and privacy risks.
Abstract
Deep neural networks (DNNs) have been shown to memorize their training data, yet similar analyses for graph neural networks (GNNs) remain largely under-explored. We introduce NCMemo (Node Classification Memorization), the first framework to quantify label memorization in semi-supervised node classification. We first establish an inverse relationship between memorization and graph homophily, i.e., the property that connected nodes share similar labels/features. We find that lower homophily significantly increases memorization, indicating that GNNs rely on memorization to learn less homophilic graphs. Secondly, we analyze GNN training dynamics. We find that the increased memorization in low homophily graphs is tightly coupled to the GNNs' implicit bias on using graph structure during learning. In low homophily regimes, this structure is less informative, hence inducing memorization of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
