Towards Understanding Why Data Augmentation Improves Generalization

Jingyang Li; Jiachun Pan; Kim-Chuan Toh; Pan Zhou

arXiv:2502.08940·cs.CV·February 14, 2025

Towards Understanding Why Data Augmentation Improves Generalization

Jingyang Li, Jiachun Pan, Kim-Chuan Toh, Pan Zhou

PDF

Open Access

TL;DR

This paper proposes a unified theoretical framework explaining how data augmentation improves model generalization by promoting diverse feature learning and increasing training complexity through feature removal and mixing.

Contribution

It introduces a comprehensive theory that explains the mechanisms of data augmentation's benefits, unifying various techniques under a common understanding.

Findings

01

Data augmentation reduces reliance on single features.

02

Feature mixing increases training complexity and robustness.

03

Unified theory validated by experimental results.

Abstract

Data augmentation is a cornerstone technique in deep learning, widely used to improve model generalization. Traditional methods like random cropping and color jittering, as well as advanced techniques such as CutOut, Mixup, and CutMix, have achieved notable success across various domains. However, the mechanisms by which data augmentation improves generalization remain poorly understood, and existing theoretical analyses typically focus on individual techniques without a unified explanation. In this work, we present a unified theoretical framework that elucidates how data augmentation enhances generalization through two key effects: partial semantic feature removal and feature mixing. Partial semantic feature removal reduces the model's reliance on individual feature, promoting diverse feature learning and better generalization. Feature mixing, by scaling down original semantic features…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Big Data Technologies and Applications · Data Analysis with R

MethodsMixup · CutMix · Focus