ReSmooth: Detecting and Utilizing OOD Samples when Training with Data   Augmentation

Chenyang Wang; Junjun Jiang; Xiong Zhou; Xianming Liu

arXiv:2205.12606·cs.CV·December 6, 2022

ReSmooth: Detecting and Utilizing OOD Samples when Training with Data Augmentation

Chenyang Wang, Junjun Jiang, Xiong Zhou, Xianming Liu

PDF

1 Repo

TL;DR

ReSmooth is a framework that detects out-of-distribution augmented samples during training and leverages them with different labels, improving the effectiveness of data augmentation techniques in deep neural network training.

Contribution

It introduces a method to identify and utilize OOD samples in augmented data, enhancing data augmentation strategies for better model performance.

Findings

01

ReSmooth improves classification accuracy across multiple benchmarks.

02

The framework effectively detects OOD samples using Gaussian mixture models.

03

ReSmooth enhances existing augmentation methods like RandAugment, rotate, and jigsaw.

Abstract

Data augmentation (DA) is a widely used technique for enhancing the training of deep neural networks. Recent DA techniques which achieve state-of-the-art performance always meet the need for diversity in augmented training samples. However, an augmentation strategy that has a high diversity usually introduces out-of-distribution (OOD) augmented samples and these samples consequently impair the performance. To alleviate this issue, we propose ReSmooth, a framework that firstly detects OOD samples in augmented samples and then leverages them. To be specific, we first use a Gaussian mixture model to fit the loss distribution of both the original and augmented samples and accordingly split these samples into in-distribution (ID) samples and OOD samples. Then we start a new training where ID and OOD samples are incorporated with different smooth labels. By treating ID samples and OOD samples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

chenyang4/resmooth
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRandAugment