Adversarial Auto-Augment with Label Preservation: A Representation Learning Principle Guided Approach
Kaiwen Yang, Yanchao Sun, Jiahao Su, Fengxiang He, Xinmei Tian, Furong, Huang, Tianyi Zhou, Dacheng Tao

TL;DR
This paper introduces a label-preserving, prior-free data augmentation method that leverages intermediate representations of models to generate effective augmentations, improving performance across various learning tasks without needing domain knowledge.
Contribution
It proposes a novel augmentation approach based on representation learning principles that does not require pre-defined operations or generative models, applicable to multiple machine learning settings.
Findings
Consistently improves performance in supervised, semi-supervised, and noisy-label learning.
Effective in domains lacking prior knowledge, such as medical imaging.
Enhances existing augmentation techniques without additional generative models.
Abstract
Data augmentation is a critical contributing factor to the success of deep learning but heavily relies on prior domain knowledge which is not always available. Recent works on automatic data augmentation learn a policy to form a sequence of augmentation operations, which are still pre-defined and restricted to limited options. In this paper, we show that a prior-free autonomous data augmentation's objective can be derived from a representation learning principle that aims to preserve the minimum sufficient information of the labels. Given an example, the objective aims at creating a distant "hard positive example" as the augmentation, while still preserving the original label. We then propose a practical surrogate to the objective that can be optimized efficiently and integrated seamlessly into existing methods for a broad class of machine learning tasks, e.g., supervised,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis
