Label Denoising through Cross-Model Agreement

Yu Wang; Xin Xin; Zaiqiao Meng; Joemon Jose; Fuli Feng

arXiv:2308.13976·cs.LG·December 20, 2023

Label Denoising through Cross-Model Agreement

Yu Wang, Xin Xin, Zaiqiao Meng, Joemon Jose, Fuli Feng

PDF

Open Access

TL;DR

This paper introduces DeCA, a cross-model agreement framework that improves learning from noisy labels by leveraging the similarity in predictions on clean data and differences on noisy data across models.

Contribution

The paper proposes a novel denoising method using cross-model agreement, effective for both binary and multi-label noisy data scenarios.

Findings

01

DeCA significantly outperforms standard training methods.

02

It improves robustness in recommendation and image classification tasks.

03

Cross-model agreement effectively reduces label noise impact.

Abstract

Learning from corrupted labels is very common in real-world machine-learning applications. Memorizing such noisy labels could affect the learning of the model, leading to sub-optimal performances. In this work, we propose a novel framework to learn robust machine-learning models from noisy labels. Through an empirical study, we find that different models make relatively similar predictions on clean examples, while the predictions on noisy examples vary much more across different models. Motivated by this observation, we propose \em denoising with cross-model agreement \em (DeCA) which aims to minimize the KL-divergence between the true label distributions parameterized by two machine learning models while maximizing the likelihood of data observation. We employ the proposed DeCA on both the binary label scenario and the multiple label scenario. For the binary label scenario, we select…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Text and Document Classification Technologies · Anomaly Detection Techniques and Applications