DeGLIF for Label Noise Robust Node Classification using GNNs
Pintu Kumar, Nandyala Hemachandra

TL;DR
DeGLIF is a novel denoising method for graph neural networks that leverages leave-one-out influence functions and a new relabelling strategy to improve node classification accuracy on noisy datasets.
Contribution
We introduce DeGLIF, a new influence-based denoising technique for GNNs that does not require noise level estimation and effectively identifies noisy nodes.
Findings
DeGLIF outperforms baseline algorithms in accuracy across multiple datasets.
The proposed variants effectively identify noisy nodes without prior noise information.
Theoretical proof shows noisy points identified can increase model risk.
Abstract
Noisy labelled datasets are generally inexpensive compared to clean labelled datasets, and the same is true for graph data. In this paper, we propose a denoising technique DeGLIF: Denoising Graph Data using Leave-One-Out Influence Function. DeGLIF uses a small set of clean data and the leave-one-out influence function to make label noise robust node-level prediction on graph data. Leave-one-out influence function approximates the change in the model parameters if a training point is removed from the training dataset. Recent advances propose a way to calculate the leave-one-out influence function for Graph Neural Networks (GNNs). We extend that recent work to estimate the change in validation loss, if a training node is removed from the training dataset. We use this estimate and a new theoretically motivated relabelling function to denoise the training dataset. We propose two DeGLIF…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Sensor and Control Systems · Text and Document Classification Technologies
MethodsSparse Evolutionary Training
