TINED: GNNs-to-MLPs by Teacher Injection and Dirichlet Energy Distillation
Ziang Zhou, Zhihao Ding, Jieming Shi, Qing Li, Shiqi Shen

TL;DR
TINED introduces a novel GNN-to-MLP distillation method that leverages layer-wise transfer, teacher injection, and Dirichlet energy to produce faster, scalable models with preserved layer insights, outperforming existing methods.
Contribution
The paper proposes TINED, a new GNN-to-MLP distillation approach using layer-by-layer transfer, Dirichlet energy distillation, and theoretical bounds, enhancing scalability and performance.
Findings
TINED outperforms GNNs and existing distillation methods on multiple datasets.
Layer-wise transfer preserves GNN layer insights in MLPs.
Dirichlet energy effectively measures and conveys layer smoothing effects.
Abstract
Graph Neural Networks (GNNs) are pivotal in graph-based learning, particularly excelling in node classification. However, their scalability is hindered by the need for multi-hop data during inference, limiting their application in latency-sensitive scenarios. Recent efforts to distill GNNs into multi-layer perceptrons (MLPs) for faster inference often underutilize the layer-level insights of GNNs. In this paper, we present TINED, a novel approach that distills GNNs to MLPs on a layer-by-layer basis using Teacher Injection and Dirichlet Energy Distillation techniques. We focus on two key operations in GNN layers: feature transformation (FT) and graph propagation (GP). We recognize that FT is computationally equivalent to a fully-connected (FC) layer in MLPs. Thus, we propose directly transferring teacher parameters from an FT in a GNN to an FC layer in the student MLP, enhanced by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Handwritten Text Recognition Techniques · Speech Recognition and Synthesis
MethodsFocus · Greedy Policy Search
