Optimizing Diffusion Rate and Label Reliability in a Graph-Based Semi-supervised Classifier
Bruno Klaus de Aquino Afonso, Lilian Berton

TL;DR
This paper improves graph-based semi-supervised classifiers by optimizing diffusion rate and label reliability, reducing overfitting, and enhancing parameter selection through leave-one-out error minimization and spectral methods.
Contribution
It introduces methods to estimate label reliability and diffusion rate, and demonstrates benefits of removing self-influence in the LGC algorithm for better semi-supervised learning.
Findings
Label reliability estimation competes with robust L1-norm methods.
Removing diagonal entries reduces overfitting.
Spectral methods enable efficient diffusion rate optimization.
Abstract
Semi-supervised learning has received attention from researchers, as it allows one to exploit the structure of unlabeled data to achieve competitive classification results with much fewer labels than supervised approaches. The Local and Global Consistency (LGC) algorithm is one of the most well-known graph-based semi-supervised (GSSL) classifiers. Notably, its solution can be written as a linear combination of the known labels. The coefficients of this linear combination depend on a parameter , determining the decay of the reward over time when reaching labeled vertices in a random walk. In this work, we discuss how removing the self-influence of a labeled instance may be beneficial, and how it relates to leave-one-out error. Moreover, we propose to minimize this leave-one-out loss with automatic differentiation. Within this framework, we propose methods to estimate label…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
