Becoming More Robust to Label Noise with Classifier Diversity

Michael R. Smith; Tony Martinez

arXiv:1403.1893·stat.ML·March 11, 2014·2 cites

Becoming More Robust to Label Noise with Classifier Diversity

Michael R. Smith, Tony Martinez

PDF

Open Access

TL;DR

This paper introduces NICD, a novel noise detection method that uses classifier diversity to improve robustness against label noise across various datasets and algorithms.

Contribution

NICD leverages classifier diversity to create a less biased noise measurement, enhancing noise handling in machine learning models.

Findings

01

NICD significantly outperforms other noise handling techniques.

02

NICD is effective across multiple datasets and learning algorithms.

03

Classifier diversity improves noise detection accuracy.

Abstract

It is widely known in the machine learning community that class noise can be (and often is) detrimental to inducing a model of the data. Many current approaches use a single, often biased, measurement to determine if an instance is noisy. A biased measure may work well on certain data sets, but it can also be less effective on a broader set of data sets. In this paper, we present noise identification using classifier diversity (NICD) -- a method for deriving a less biased noise measurement and integrating it into the learning process. To lessen the bias of the noise measure, NICD selects a diverse set of classifiers (based on their predictions of novel instances) to determine which instances are noisy. We examine NICD as a technique for filtering, instance weighting, and selecting the base classifiers of a voting ensemble. We compare NICD with several other noise handling techniques…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Neural Networks and Applications · Data Stream Mining Techniques