Generalization Gap in Data Augmentation: Insights from Illumination

Jianqiang Xiao; Weiwen Guo; Junfeng Liu; Mengze Li

arXiv:2404.07514·cs.CV·August 22, 2024·2 cites

Generalization Gap in Data Augmentation: Insights from Illumination

Jianqiang Xiao, Weiwen Guo, Junfeng Liu, Mengze Li

PDF

Open Access

TL;DR

This paper investigates how data augmentation affects model generalization in computer vision, revealing that despite improvements, a significant gap remains due to differences in artificial and real-world visual features, especially illumination.

Contribution

It introduces the concept of visual representation variables and analyzes the impact of illumination-based data augmentation on model generalization, highlighting persistent gaps.

Findings

01

Data augmentation improves performance but does not eliminate the generalization gap.

02

Differences in artificial and natural illumination features affect model robustness.

03

Feature diversity in training data is crucial for better generalization.

Abstract

In the field of computer vision, data augmentation is widely used to enrich the feature complexity of training datasets with deep learning techniques. However, regarding the generalization capabilities of models, the difference in artificial features generated by data augmentation and natural visual features has not been fully revealed. This study introduces the concept of "visual representation variables" to define the possible visual variations in a task as a joint distribution of these variables. We focus on the visual representation variable "illumination", by simulating its distribution degradation and examining how data augmentation techniques enhance model performance on a classification task. Our goal is to investigate the differences in generalization between models trained with augmented data and those trained under real-world illumination conditions. Results indicate that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Advanced Vision and Imaging

MethodsSparse Evolutionary Training · Focus