Learning from Data with Noisy Labels Using Temporal Self-Ensemble
Jun Ho Lee, Jae Soon Baik, Tae Hwan Hwang, and Jun Won Choi

TL;DR
This paper introduces SRT, a robust training method for deep neural networks that effectively handles noisy labels by using temporal self-ensembles and multi-view predictions, eliminating the need for dual networks.
Contribution
The paper proposes a novel single-network robust training scheme using temporal self-ensembles and multi-view agreement to identify and filter noisy labels.
Findings
Achieves state-of-the-art performance on public datasets.
Reduces computational resources compared to dual-network methods.
Effectively filters noisy labels during training.
Abstract
There are inevitably many mislabeled data in real-world datasets. Because deep neural networks (DNNs) have an enormous capacity to memorize noisy labels, a robust training scheme is required to prevent labeling errors from degrading the generalization performance of DNNs. Current state-of-the-art methods present a co-training scheme that trains dual networks using samples associated with small losses. In practice, however, training two networks simultaneously can burden computing resources. In this study, we propose a simple yet effective robust training scheme that operates by training only a single network. During training, the proposed method generates temporal self-ensemble by sampling intermediate network parameters from the weight trajectory formed by stochastic gradient descent optimization. The loss sum evaluated with these self-ensembles is used to identify incorrectly labeled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMusic and Audio Processing · Machine Learning and Data Classification · Video Surveillance and Tracking Methods
