MixUp as Locally Linear Out-Of-Manifold Regularization

Hongyu Guo; Yongyi Mao; Richong Zhang

arXiv:1809.02499·cs.LG·November 26, 2018·6 cites

MixUp as Locally Linear Out-Of-Manifold Regularization

Hongyu Guo, Yongyi Mao, Richong Zhang

PDF

Open Access 2 Repos

TL;DR

This paper analyzes MixUp as a form of out-of-manifold regularization, identifies a limitation called manifold intrusion, and proposes AdaMixUp, an adaptive version that learns mixing policies to improve deep neural network training.

Contribution

It provides a theoretical understanding of MixUp, introduces the concept of manifold intrusion, and proposes AdaMixUp, an adaptive regularizer that enhances MixUp's effectiveness.

Findings

01

AdaMixUp outperforms standard MixUp on benchmark datasets.

02

Manifold intrusion can cause under-fitting in MixUp.

03

Adaptive learning of mixing policies mitigates manifold intrusion.

Abstract

MixUp is a recently proposed data-augmentation scheme, which linearly interpolates a random pair of training examples and correspondingly the one-hot representations of their labels. Training deep neural networks with such additional data is shown capable of significantly improving the predictive accuracy of the current art. The power of MixUp, however, is primarily established empirically and its working and effectiveness have not been explained in any depth. In this paper, we develop an understanding for MixUp as a form of "out-of-manifold regularization", which imposes certain "local linearity" constraints on the model's input space beyond the data manifold. This analysis enables us to identify a limitation of MixUp, which we call "manifold intrusion". In a nutshell, manifold intrusion in MixUp is a form of under-fitting resulting from conflicts between the synthetic labels of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and Data Classification · Human Pose and Action Recognition

MethodsMixup