Distilling Influences to Mitigate Prediction Churn in Graph Neural   Networks

Andreas Roth; Thomas Liebig

arXiv:2310.00946·cs.LG·October 3, 2023

Distilling Influences to Mitigate Prediction Churn in Graph Neural Networks

Andreas Roth, Thomas Liebig

PDF

Open Access 1 Repo

TL;DR

This paper investigates prediction churn in graph neural networks, introduces the Influence Difference metric, and proposes DropDistillation to improve model stability and performance in knowledge distillation tasks.

Contribution

It introduces the Influence Difference metric and DropDistillation method to reduce prediction churn and enhance stability in graph neural network knowledge distillation.

Findings

01

DropDistillation outperforms previous methods in stability and performance

02

Influence Difference effectively quantifies feature utilization differences

03

Models with minimized influence difference show improved prediction consistency

Abstract

Models with similar performances exhibit significant disagreement in the predictions of individual samples, referred to as prediction churn. Our work explores this phenomenon in graph neural networks by investigating differences between models differing only in their initializations in their utilized features for predictions. We propose a novel metric called Influence Difference (ID) to quantify the variation in reasons used by nodes across models by comparing their influence distribution. Additionally, we consider the differences between nodes with a stable and an unstable prediction, positing that both equally utilize different reasons and thus provide a meaningful gradient signal to closely match two models even when the predictions for nodes are similar. Based on our analysis, we propose to minimize this ID in Knowledge Distillation, a domain where a new model should closely match…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

roth-andreas/distilling-influences
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Graph Neural Networks · Explainable Artificial Intelligence (XAI) · Machine Learning in Materials Science

MethodsKnowledge Distillation