TL;DR
ROD introduces a reception-aware online distillation method that enhances learning on sparse graphs by leveraging multi-scale reception knowledge and peer-teaching, significantly improving performance across multiple graph tasks.
Contribution
The paper presents a novel reception-aware online knowledge distillation framework specifically designed for sparse graph learning, incorporating multi-scale reception signals and dynamic teacher models.
Findings
Achieves state-of-the-art results on 9 datasets.
Improves robustness in sparse graph scenarios.
Enhances performance across node classification, link prediction, and clustering.
Abstract
Graph neural networks (GNNs) have been widely used in many graph-based tasks such as node classification, link prediction, and node clustering. However, GNNs gain their performance benefits mainly from performing the feature propagation and smoothing across the edges of the graph, thus requiring sufficient connectivity and label information for effective propagation. Unfortunately, many real-world networks are sparse in terms of both edges and labels, leading to sub-optimal performance of GNNs. Recent interest in this sparse problem has focused on the self-training approach, which expands supervised signals with pseudo labels. Nevertheless, the self-training approach inherently cannot realize the full potential of refining the learning performance on sparse graphs due to the unsatisfactory quality and quantity of pseudo labels. In this paper, we propose ROD, a novel reception-aware…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsKnowledge Distillation
