FreeKD: Free-direction Knowledge Distillation for Graph Neural Networks

Kaituo Feng; Changsheng Li; Ye Yuan; Guoren Wang

arXiv:2206.06561·cs.LG·March 28, 2023

FreeKD: Free-direction Knowledge Distillation for Graph Neural Networks

Kaituo Feng, Changsheng Li, Ye Yuan, Guoren Wang

PDF

TL;DR

FreeKD introduces a reinforcement learning-based framework for graph neural network knowledge distillation that eliminates the need for a well-optimized teacher, enabling two shallow GNNs to exchange knowledge dynamically and effectively.

Contribution

This work is the first to propose a free-direction knowledge distillation framework for GNNs using reinforcement learning, removing the dependency on a deep teacher model.

Findings

01

FreeKD outperforms baseline GNNs on five benchmark datasets.

02

FreeKD achieves comparable or better results than traditional KD methods with deep teachers.

03

The framework is compatible with various GNN architectures.

Abstract

Knowledge distillation (KD) has demonstrated its effectiveness to boost the performance of graph neural networks (GNNs), where its goal is to distill knowledge from a deeper teacher GNN into a shallower student GNN. However, it is actually difficult to train a satisfactory teacher GNN due to the well-known over-parametrized and over-smoothing issues, leading to invalid knowledge transfer in practical applications. In this paper, we propose the first Free-direction Knowledge Distillation framework via Reinforcement learning for GNNs, called FreeKD, which is no longer required to provide a deeper well-optimized teacher GNN. The core idea of our work is to collaboratively build two shallower GNNs in an effort to exchange knowledge between them via reinforcement learning in a hierarchical way. As we observe that one typical GNN model often has better and worse performances at different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsBalanced Selection · Knowledge Distillation