MixUp as Locally Linear Out-Of-Manifold Regularization
Hongyu Guo, Yongyi Mao, Richong Zhang

TL;DR
This paper analyzes MixUp as a form of out-of-manifold regularization, identifies a limitation called manifold intrusion, and proposes AdaMixUp, an adaptive version that learns mixing policies to improve deep neural network training.
Contribution
It provides a theoretical understanding of MixUp, introduces the concept of manifold intrusion, and proposes AdaMixUp, an adaptive regularizer that enhances MixUp's effectiveness.
Findings
AdaMixUp outperforms standard MixUp on benchmark datasets.
Manifold intrusion can cause under-fitting in MixUp.
Adaptive learning of mixing policies mitigates manifold intrusion.
Abstract
MixUp is a recently proposed data-augmentation scheme, which linearly interpolates a random pair of training examples and correspondingly the one-hot representations of their labels. Training deep neural networks with such additional data is shown capable of significantly improving the predictive accuracy of the current art. The power of MixUp, however, is primarily established empirically and its working and effectiveness have not been explained in any depth. In this paper, we develop an understanding for MixUp as a form of "out-of-manifold regularization", which imposes certain "local linearity" constraints on the model's input space beyond the data manifold. This analysis enables us to identify a limitation of MixUp, which we call "manifold intrusion". In a nutshell, manifold intrusion in MixUp is a form of under-fitting resulting from conflicts between the synthetic labels of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Human Pose and Action Recognition
MethodsMixup
