Compressing Deep Graph Neural Networks via Adversarial Knowledge Distillation
Huarui He, Jie Wang, Zhanqiu Zhang, Feng Wu

TL;DR
This paper introduces GraphAKD, an adversarial knowledge distillation framework for compressing deep GNNs by training a generator and discriminator to effectively transfer knowledge from teacher to student models.
Contribution
It is the first to apply adversarial training to knowledge distillation in graph neural networks, improving the transfer of complex structural knowledge.
Findings
Significant performance improvements on node and graph classification tasks.
Effective compression of deep GNNs with minimal loss of accuracy.
Demonstrated superiority over traditional distillation methods.
Abstract
Deep graph neural networks (GNNs) have been shown to be expressive for modeling graph-structured data. Nevertheless, the over-stacked architecture of deep graph models makes it difficult to deploy and rapidly test on mobile or embedded systems. To compress over-stacked GNNs, knowledge distillation via a teacher-student architecture turns out to be an effective technique, where the key step is to measure the discrepancy between teacher and student networks with predefined distance functions. However, using the same distance for graphs of various structures may be unfit, and the optimal distance formulation is hard to determine. To tackle these problems, we propose a novel Adversarial Knowledge Distillation framework for graph models named GraphAKD, which adversarially trains a discriminator and a generator to adaptively detect and decrease the discrepancy. Specifically, noticing that the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Materials Science · Adversarial Robustness in Machine Learning
MethodsKnowledge Distillation
