Learning from Training Dynamics: Identifying Mislabeled Data Beyond Manually Designed Features
Qingrui Jia, Xuhong Li, Lei Yu, Jiang Bian, Penghao Zhao, Shupeng Li,, Haoyi Xiong, Dejing Dou

TL;DR
This paper introduces a learning-based method using an LSTM noise detector to identify mislabeled samples in training data, outperforming existing techniques and aiding data debugging across multiple datasets.
Contribution
The paper presents a novel LSTM-based noise detection approach that learns from training dynamics and generalizes across datasets without retraining.
Findings
The method accurately detects mislabeled samples across various datasets.
It outperforms state-of-the-art mislabeled data detection techniques.
The approach facilitates effective data debugging and label correction.
Abstract
While mislabeled or ambiguously-labeled samples in the training set could negatively affect the performance of deep models, diagnosing the dataset and identifying mislabeled samples helps to improve the generalization power. Training dynamics, i.e., the traces left by iterations of optimization algorithms, have recently been proved to be effective to localize mislabeled samples with hand-crafted features. In this paper, beyond manually designed features, we introduce a novel learning-based solution, leveraging a noise detector, instanced by an LSTM network, which learns to predict whether a sample was mislabeled using the raw training dynamics as input. Specifically, the proposed method trains the noise detector in a supervised manner using the dataset with synthesized label noises and can adapt to various datasets (either naturally or synthesized label-noised) without retraining. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
MethodsTest · Sigmoid Activation · Tanh Activation · Long Short-Term Memory
