Learning Tree-Structured Composition of Data Augmentation
Dongyue Li, Kailai Chen, Predrag Radivojac, and Hongyang R. Zhang

TL;DR
This paper introduces a fast, tree-structured data augmentation search algorithm that improves efficiency and effectiveness in neural network training, especially for heterogeneous data distributions, by reducing computational cost and enhancing performance.
Contribution
The paper presents a novel binary tree-structured search algorithm for data augmentation, significantly faster than existing methods, and demonstrates its effectiveness on graph and image datasets.
Findings
Reduces search computation cost by 43%
Improves augmentation performance by 4.3%
Provides interpretable tree structures for transformation importance
Abstract
Data augmentation is widely used for training a neural network given little labeled data. A common practice of augmentation training is applying a composition of multiple transformations sequentially to the data. Existing augmentation methods such as RandAugment randomly sample from a list of pre-selected transformations, while methods such as AutoAugment apply advanced search to optimize over an augmentation set of size , which is the number of transformation sequences of length , given a list of transformations. In this paper, we design efficient algorithms whose running time complexity is much faster than the worst-case complexity of , provably. We propose a new algorithm to search for a binary tree-structured composition of transformations, where each tree node corresponds to one transformation. The binary tree generalizes sequential augmentations, such as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Data Classification
Methods*Communicated@Fast*How Do I Communicate to Expedia? · Bitcoin Customer Service Number +1-833-534-1729 · Sparse Evolutionary Training · Dense Connections · Tanh Activation · Kaiming Initialization · Max Pooling · Sigmoid Activation · Convolution · Average Pooling
