Meta Approach to Data Augmentation Optimization

Ryuichiro Hataya; Jan Zdenek; Kazuki Yoshizoe; Hideki Nakayama

arXiv:2006.07965·cs.CV·June 16, 2020·6 cites

Meta Approach to Data Augmentation Optimization

Ryuichiro Hataya, Jan Zdenek, Kazuki Yoshizoe, Hideki Nakayama

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel method for jointly optimizing image recognition models and data augmentation policies directly via gradient descent, enhancing performance without proxy tasks or extensive hyperparameter tuning.

Contribution

It presents a scalable, efficient approach that approximates policy gradients using implicit gradients and Neumann series, avoiding prior search space reductions.

Findings

01

Improves ImageNet classification accuracy

02

Enhances fine-grained recognition performance

03

Eliminates need for dataset-specific hyperparameter tuning

Abstract

Data augmentation policies drastically improve the performance of image recognition tasks, especially when the policies are optimized for the target data and tasks. In this paper, we propose to optimize image recognition models and data augmentation policies simultaneously to improve the performance using gradient descent. Unlike prior methods, our approach avoids using proxy tasks or reducing search space, and can directly improve the validation performance. Our method achieves efficient and scalable training by approximating the gradient of policies by implicit gradient with Neumann series approximation. We demonstrate that our approach can improve the performance of various image classification tasks, including ImageNet classification and fine-grained recognition, without using dataset-specific hyperparameter tuning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Meta Approach to Data Augmentation Optimization· youtube

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Stochastic Gradient Optimization Techniques