Delving into Sample Loss Curve to Embrace Noisy and Imbalanced Data
Shenwang Jiang, Jianan Li, Ying Wang, Bo Huang, Zhang Zhang, Tingfa Xu

TL;DR
This paper introduces a novel loss curve analysis method combined with a curve-perception network, CurveNet, to effectively handle noisy and imbalanced data simultaneously in deep neural network training.
Contribution
It proposes a new probe-and-allocate training strategy using loss curve characteristics and introduces SLMO to accelerate meta-learning, addressing a key challenge in biased data handling.
Findings
Achieves state-of-the-art performance on multiple benchmarks.
Effectively distinguishes biased sample types using loss curve trends.
Speeds up meta-learning with the proposed SLMO method.
Abstract
Corrupted labels and class imbalance are commonly encountered in practically collected training data, which easily leads to over-fitting of deep neural networks (DNNs). Existing approaches alleviate these issues by adopting a sample re-weighting strategy, which is to re-weight sample by designing weighting function. However, it is only applicable for training data containing only either one type of data biases. In practice, however, biased samples with corrupted labels and of tailed classes commonly co-exist in training data. How to handle them simultaneously is a key but under-explored problem. In this paper, we find that these two types of biased samples, though have similar transient loss, have distinguishable trend and characteristics in loss curves, which could provide valuable priors for sample weight assignment. Motivated by this, we delve into the loss curves and propose a novel…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
