Self-Adaptive Training: beyond Empirical Risk Minimization
Lang Huang, Chao Zhang, Hongyang Zhang

TL;DR
Self-adaptive training is a novel algorithm that dynamically corrects training labels using model predictions, enhancing generalization and robustness against noisy data without extra computational cost.
Contribution
It introduces a self-adaptive training method that improves over ERM by correcting labels during training, reducing overfitting to noise and adversarial samples.
Findings
Significantly improves generalization under label noise.
Mitigates overfitting in natural and adversarial training.
Test error decreases monotonously with model capacity.
Abstract
We propose self-adaptive training---a new training algorithm that dynamically corrects problematic training labels by model predictions without incurring extra computational cost---to improve generalization of deep learning for potentially corrupted training data. This problem is crucial towards robustly learning from data that are corrupted by, e.g., label noises and out-of-distribution samples. The standard empirical risk minimization (ERM) for such data, however, may easily overfit noises and thus suffers from sub-optimal performance. In this paper, we observe that model predictions can substantially benefit the training process: self-adaptive training significantly improves generalization over ERM under various levels of noises, and mitigates the overfitting issue in both natural and adversarial training. We evaluate the error-capacity curve of self-adaptive training: the test error…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Adversarial Robustness in Machine Learning · Machine Learning and Algorithms
MethodsTest · Self-adaptive Training
