Model-agnostic information transfer and fusion for classification with label noise
Zhu Guojun, Zhang Sanguo, Ren Mingyang

TL;DR
This paper introduces a model-agnostic, nonparametric framework for classification with label noise, effectively leveraging small clean datasets to improve large noisy data, especially in complex domains like medical imaging.
Contribution
It proposes a novel, generic nonparametric approach that addresses distribution shift issues and is applicable across various classifiers, backed by rigorous theory and empirical validation.
Findings
Framework effectively utilizes small clean data to purify large noisy datasets.
Method demonstrates superior performance in medical image classification tasks.
Theoretical guarantees support the robustness of the approach.
Abstract
Label noise presents a fundamental challenge in modern machine learning, especially when large-scale datasets are generated via automated processes. An increasingly common and important data paradigm, particularly in domains like medical imaging, involves learning from a large dataset with coarse, noisy labels supplemented by a small, expert-verified, clean dataset. This setting constitutes a typical information transfer and fusion problem. However, the significant distribution shift between the noisy and clean data violates the core overall parametric similarity assumptions of existing statistical transfer learning methods, while their reliance on parametric models is ill-suited for complex data like images. To address these limitations, this paper develops a generic model-agnostic nonparametric framework for classification with label noise, which applies to a broad class of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
