Model-agnostic information transfer and fusion for classification with label noise

Zhu Guojun; Zhang Sanguo; Ren Mingyang

arXiv:2604.25845·stat.ME·April 29, 2026

Model-agnostic information transfer and fusion for classification with label noise

Zhu Guojun, Zhang Sanguo, Ren Mingyang

PDF

TL;DR

This paper introduces a model-agnostic, nonparametric framework for classification with label noise, effectively leveraging small clean datasets to improve large noisy data, especially in complex domains like medical imaging.

Contribution

It proposes a novel, generic nonparametric approach that addresses distribution shift issues and is applicable across various classifiers, backed by rigorous theory and empirical validation.

Findings

01

Framework effectively utilizes small clean data to purify large noisy datasets.

02

Method demonstrates superior performance in medical image classification tasks.

03

Theoretical guarantees support the robustness of the approach.

Abstract

Label noise presents a fundamental challenge in modern machine learning, especially when large-scale datasets are generated via automated processes. An increasingly common and important data paradigm, particularly in domains like medical imaging, involves learning from a large dataset with coarse, noisy labels supplemented by a small, expert-verified, clean dataset. This setting constitutes a typical information transfer and fusion problem. However, the significant distribution shift between the noisy and clean data violates the core overall parametric similarity assumptions of existing statistical transfer learning methods, while their reliance on parametric models is ill-suited for complex data like images. To address these limitations, this paper develops a generic model-agnostic nonparametric framework for classification with label noise, which applies to a broad class of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.