Improving Generalization of Deep Fault Detection Models in the Presence   of Mislabeled Data

Katharina Rombach; Gabriel Michau; Olga Fink

arXiv:2009.14606·cs.LG·November 24, 2021

Improving Generalization of Deep Fault Detection Models in the Presence of Mislabeled Data

Katharina Rombach, Gabriel Michau, Olga Fink

PDF

TL;DR

This paper introduces a two-step framework to improve the generalization of deep fault detection models by effectively handling mislabeled data, which is common in real-world datasets, especially when no noise-free validation data exists.

Contribution

A novel two-step approach that identifies outliers and adjusts training data to enhance robustness against label noise in fault detection models.

Findings

01

Significantly improves model generalization under high label noise.

02

Effective in real-world scenarios without clean validation datasets.

03

Outperforms previous methods in robustness to mislabeled data.

Abstract

Mislabeled samples are ubiquitous in real-world datasets as rule-based or expert labeling is usually based on incorrect assumptions or subject to biased opinions. Neural networks can "memorize" these mislabeled samples and, as a result, exhibit poor generalization. This poses a critical issue in fault detection applications, where not only the training but also the validation datasets are prone to contain mislabeled samples. In this work, we propose a novel two-step framework for robust training with label noise. In the first step, we identify outliers (including the mislabeled samples) based on the update in the hypothesis space. In the second step, we propose different approaches to modifying the training data based on the identified outliers and a data augmentation technique. Contrary to previous approaches, we aim at finding a robust solution that is suitable for real-world…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.