DiffusAL: Coupling Active Learning with Graph Diffusion for Label-Efficient Node Classification
Sandra Gilhuber, Julian Busch, Daniel Rotthues, Christian M. M. Frey, and Thomas Seidl

TL;DR
DiffusAL is a novel active graph learning method that combines multiple scoring functions, including graph diffusion heuristics, to efficiently select informative nodes for labeling, significantly outperforming random sampling across diverse datasets.
Contribution
We introduce DiffusAL, a robust, parameter-free active learning approach that integrates model uncertainty, diversity, and graph diffusion heuristics for improved node selection.
Findings
Outperforms random sampling on all benchmark datasets.
Efficient pre-processing enables faster acquisition and training.
Significantly improves label efficiency in node classification.
Abstract
Node classification is one of the core tasks on attributed graphs, but successful graph learning solutions require sufficiently labeled data. To keep annotation costs low, active graph learning focuses on selecting the most qualitative subset of nodes that maximizes label efficiency. However, deciding which heuristic is best suited for an unlabeled graph to increase label efficiency is a persistent challenge. Existing solutions either neglect aligning the learned model and the sampling method or focus only on limited selection aspects. They are thus sometimes worse or only equally good as random sampling. In this work, we introduce a novel active graph learning approach called DiffusAL, showing significant robustness in diverse settings. Toward better transferability between different graph structures, we combine three independent scoring functions to identify the most informative node…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Text and Document Classification Technologies · Data Quality and Management
MethodsDiffusion · Focus
