Reweighting Augmented Samples by Minimizing the Maximal Expected Loss
Mingyang Yi, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu, Zhi-Ming Ma

TL;DR
This paper introduces a novel reweighting strategy for augmented samples in deep learning, focusing on minimizing the worst-case expected loss to improve model generalization across various data augmentation techniques.
Contribution
It proposes a new loss minimization approach inspired by adversarial training that assigns higher weights to harder augmented samples, enhancing generalization.
Findings
Improves generalization on natural language understanding tasks.
Enhances performance in image classification with common augmentations.
Applicable on top of existing data augmentation methods.
Abstract
Data augmentation is an effective technique to improve the generalization of deep neural networks. However, previous data augmentation methods usually treat the augmented samples equally without considering their individual impacts on the model. To address this, for the augmented samples from the same training example, we propose to assign different weights to them. We construct the maximal expected loss which is the supremum over any reweighted loss on augmented samples. Inspired by adversarial training, we minimize this maximal expected loss (MMEL) and obtain a simple and interpretable closed-form solution: more attention should be paid to augmented samples with large loss values (i.e., harder examples). Minimizing this maximal expected loss enables the model to perform well under any reweighting strategy. The proposed method can generally be applied on top of any data augmentation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsImage and Signal Denoising Methods · Advanced Data Compression Techniques · Generative Adversarial Networks and Image Synthesis
