Learning to Rectify for Robust Learning with Noisy Labels
Haoliang Sun, Chenhui Guo, Qi Wei, Zhongyi Han, Yilong Yin

TL;DR
This paper introduces WarPI, a hierarchical probabilistic meta-learning approach that adaptively rectifies training in noisy label scenarios, significantly improving robustness and generalization of deep classifiers.
Contribution
WarPI is a novel hierarchical probabilistic model with a learned meta-network that adaptively rectifies training, outperforming existing methods under various label noise conditions.
Findings
Achieves state-of-the-art results on four noisy label benchmarks.
Effectively resolves sample ambiguity and improves robustness.
Seamlessly integrates into SGD optimization for deep networks.
Abstract
Label noise significantly degrades the generalization ability of deep models in applications. Effective strategies and approaches, \textit{e.g.} re-weighting, or loss correction, are designed to alleviate the negative impact of label noise when training a neural network. Those existing works usually rely on the pre-specified architecture and manually tuning the additional hyper-parameters. In this paper, we propose warped probabilistic inference (WarPI) to achieve adaptively rectifying the training procedure for the classification network within the meta-learning scenario. In contrast to the deterministic models, WarPI is formulated as a hierarchical probabilistic model by learning an amortization meta-network, which can resolve sample ambiguity and be therefore more robust to serious label noise. Unlike the existing approximated weighting function of directly generating weight values…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications
MethodsStochastic Gradient Descent
