Reweighting Augmented Samples by Minimizing the Maximal Expected Loss

Mingyang Yi; Lu Hou; Lifeng Shang; Xin Jiang; Qun Liu; Zhi-Ming Ma

arXiv:2103.08933·cs.LG·March 17, 2021·1 cites

Reweighting Augmented Samples by Minimizing the Maximal Expected Loss

Mingyang Yi, Lu Hou, Lifeng Shang, Xin Jiang, Qun Liu, Zhi-Ming Ma

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a novel reweighting strategy for augmented samples in deep learning, focusing on minimizing the worst-case expected loss to improve model generalization across various data augmentation techniques.

Contribution

It proposes a new loss minimization approach inspired by adversarial training that assigns higher weights to harder augmented samples, enhancing generalization.

Findings

01

Improves generalization on natural language understanding tasks.

02

Enhances performance in image classification with common augmentations.

03

Applicable on top of existing data augmentation methods.

Abstract

Data augmentation is an effective technique to improve the generalization of deep neural networks. However, previous data augmentation methods usually treat the augmented samples equally without considering their individual impacts on the model. To address this, for the augmented samples from the same training example, we propose to assign different weights to them. We construct the maximal expected loss which is the supremum over any reweighted loss on augmented samples. Inspired by adversarial training, we minimize this maximal expected loss (MMEL) and obtain a simple and interpretable closed-form solution: more attention should be paid to augmented samples with large loss values (i.e., harder examples). Minimizing this maximal expected loss enables the model to perform well under any reweighting strategy. The proposed method can generally be applied on top of any data augmentation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yookyungkho/mav
pytorch

Videos

Reweighting Augmented Samples by Minimizing the Maximal Expected Loss· slideslive

Taxonomy

TopicsImage and Signal Denoising Methods · Advanced Data Compression Techniques · Generative Adversarial Networks and Image Synthesis