Improving Deep Learning using Generic Data Augmentation

Luke Taylor; Geoff Nitschke

arXiv:1708.06020·cs.LG·August 22, 2017·159 cites

Improving Deep Learning using Generic Data Augmentation

Luke Taylor, Geoff Nitschke

PDF

Open Access

TL;DR

This paper benchmarks various generic data augmentation techniques for CNNs, demonstrating that cropping significantly improves task performance, thereby guiding researchers in selecting effective augmentation methods.

Contribution

It provides a comparative analysis of popular data augmentation schemes, highlighting the effectiveness of cropping in geometric augmentation for CNN training.

Findings

01

Cropping in geometric augmentation improves CNN accuracy.

02

Different augmentation schemes have varying impacts on performance.

03

Benchmark results guide optimal augmentation choices for CNNs.

Abstract

Deep artificial neural networks require a large corpus of training data in order to effectively learn, where collection of such training data is often expensive and laborious. Data augmentation overcomes this issue by artificially inflating the training set with label preserving transformations. Recently there has been extensive use of generic data augmentation to improve Convolutional Neural Network (CNN) task performance. This study benchmarks various popular data augmentation schemes to allow researchers to make informed decisions as to which training methods are most appropriate for their data sets. Various geometric and photometric schemes are evaluated on a coarse-grained data set using a relatively simple CNN. Experimental results, run using 4-fold cross-validation and reported in terms of Top-1 and Top-5 accuracy, indicate that cropping in geometric augmentation significantly…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Advanced Neural Network Applications