Self-Adaptive Training: beyond Empirical Risk Minimization

Lang Huang; Chao Zhang; Hongyang Zhang

arXiv:2002.10319·cs.LG·October 1, 2020·88 cites

Self-Adaptive Training: beyond Empirical Risk Minimization

Lang Huang, Chao Zhang, Hongyang Zhang

PDF

Open Access 4 Repos 1 Video

TL;DR

Self-adaptive training is a novel algorithm that dynamically corrects training labels using model predictions, enhancing generalization and robustness against noisy data without extra computational cost.

Contribution

It introduces a self-adaptive training method that improves over ERM by correcting labels during training, reducing overfitting to noise and adversarial samples.

Findings

01

Significantly improves generalization under label noise.

02

Mitigates overfitting in natural and adversarial training.

03

Test error decreases monotonously with model capacity.

Abstract

We propose self-adaptive training---a new training algorithm that dynamically corrects problematic training labels by model predictions without incurring extra computational cost---to improve generalization of deep learning for potentially corrupted training data. This problem is crucial towards robustly learning from data that are corrupted by, e.g., label noises and out-of-distribution samples. The standard empirical risk minimization (ERM) for such data, however, may easily overfit noises and thus suffers from sub-optimal performance. In this paper, we observe that model predictions can substantially benefit the training process: self-adaptive training significantly improves generalization over ERM under various levels of noises, and mitigates the overfitting issue in both natural and adversarial training. We evaluate the error-capacity curve of self-adaptive training: the test error…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Self-Adaptive Training: beyond Empirical Risk Minimization· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms

MethodsTest · Self-adaptive Training