Label noise detection under the Noise at Random model with ensemble   filters

Kecia G. Moura; Ricardo B. C. Prud\^encio; George D. C. Cavalcanti

arXiv:2112.01617·cs.LG·October 11, 2022

Label noise detection under the Noise at Random model with ensemble filters

Kecia G. Moura, Ricardo B. C. Prud\^encio, George D. C. Cavalcanti

PDF

1 Repo

TL;DR

This paper examines how ensemble classifiers detect label noise under the Noisy at Random model, revealing that class-dependent noise impacts detection performance and that threshold choices significantly influence results.

Contribution

It introduces an analysis of ensemble noise detection under class-dependent noise models and evaluates the effect of class distribution and threshold selection on detection accuracy.

Findings

01

Class-dependent noise affects detection performance.

02

Threshold selection influences noise detection accuracy.

03

Different noise models lead to varying detection results.

Abstract

Label noise detection has been widely studied in Machine Learning because of its importance in improving training data quality. Satisfactory noise detection has been achieved by adopting ensembles of classifiers. In this approach, an instance is assigned as mislabeled if a high proportion of members in the pool misclassifies it. Previous authors have empirically evaluated this approach; nevertheless, they mostly assumed that label noise is generated completely at random in a dataset. This is a strong assumption since other types of label noise are feasible in practice and can influence noise detection results. This work investigates the performance of ensemble noise detection under two different noise models: the Noisy at Random (NAR), in which the probability of label noise depends on the instance class, in comparison to the Noisy Completely at Random model, in which the probability of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kdMoura/label-noise
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.