Informative Pseudo-Labeling for Graph Neural Networks with Few Labels
Yayong Li, Jie Yin, Ling Chen

TL;DR
This paper introduces InfoGNN, a novel pseudo-labeling framework for graph neural networks that selects the most informative nodes based on mutual information maximization, significantly improving learning with very few labels.
Contribution
The paper proposes a new informative pseudo-labeling method, InfoGNN, which enhances GNN training with limited labels by selecting nodes that maximize local neighborhood representation.
Findings
Outperforms state-of-the-art baselines on six datasets
Effectively mitigates label noise and class imbalance
Significantly improves semi-supervised node classification accuracy
Abstract
Graph Neural Networks (GNNs) have achieved state-of-the-art results for semi-supervised node classification on graphs. Nevertheless, the challenge of how to effectively learn GNNs with very few labels is still under-explored. As one of the prevalent semi-supervised methods, pseudo-labeling has been proposed to explicitly address the label scarcity problem. It aims to augment the training set with pseudo-labeled unlabeled nodes with high confidence so as to re-train a supervised model in a self-training cycle. However, the existing pseudo-labeling approaches often suffer from two major drawbacks. First, they tend to conservatively expand the label set by selecting only high-confidence unlabeled nodes without assessing their informativeness. Unfortunately, those high-confidence nodes often convey overlapping information with given labels, leading to minor improvements for model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Graph Theory and Algorithms · Online Learning and Analytics
