Online Cross-Layer Knowledge Distillation on Graph Neural Networks with Deep Supervision
Jiongyu Guo, Defang Chen, Can Wang

TL;DR
This paper introduces Alignahead++, an online cross-layer knowledge distillation method for GNNs that employs deep supervision and collaborative training to improve student model performance without pre-trained teachers.
Contribution
Proposes a novel online distillation framework with deep supervision for GNNs, enabling effective training without pre-trained teacher models and enhancing performance through multi-student collaboration.
Findings
Consistently improves student GNN performance across datasets
Effectiveness increases with more student models
Avoids over-smoothing with deep supervision
Abstract
Graph neural networks (GNNs) have become one of the most popular research topics in both academia and industry communities for their strong ability in handling irregular graph data. However, large-scale datasets are posing great challenges for deploying GNNs in edge devices with limited resources and model compression techniques have drawn considerable research attention. Existing model compression techniques such as knowledge distillation (KD) mainly focus on convolutional neural networks (CNNs). Only limited attempts have been made recently for distilling knowledge from GNNs in an offline manner. As the performance of the teacher model does not necessarily improve as the number of layers increases in GNNs, selecting an appropriate teacher model will require substantial efforts. To address these challenges, we propose a novel online knowledge distillation framework called Alignahead++…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Brain Tumor Detection and Classification · Artificial Intelligence in Healthcare
MethodsKnowledge Distillation · Auxiliary Classifier
