TL;DR
This paper explores a generalized form of mixed-example data augmentation, revealing a broader space of effective techniques that outperform previous methods and challenge existing theories on why such augmentations work.
Contribution
It introduces a generalized framework for mixed-example data augmentation, expanding the set of effective techniques beyond prior approaches and providing insights into their underlying mechanisms.
Findings
Broader space of mixed-example augmentation techniques identified.
Some new methods outperform previous state-of-the-art.
Current theories on augmentation effectiveness are incomplete.
Abstract
In order to reduce overfitting, neural networks are typically trained with data augmentation, the practice of artificially generating additional training data via label-preserving transformations of existing training examples. While these types of transformations make intuitive sense, recent work has demonstrated that even non-label-preserving data augmentation can be surprisingly effective, examining this type of data augmentation through linear combinations of pairs of examples. Despite their effectiveness, little is known about why such methods work. In this work, we aim to explore a new, more generalized form of this type of data augmentation in order to determine whether such linearity is necessary. By considering this broader scope of "mixed-example data augmentation", we find a much larger space of practical augmentation techniques, including methods that improve upon previous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
