Augmentation Pathways Network for Visual Recognition
Yalong Bai, Mohan Zhou, Wei Zhang, Bowen Zhou, Tao Mei

TL;DR
This paper proposes Augmentation Pathways, a novel neural network architecture that stabilizes training with diverse heavy data augmentations, improving visual recognition performance without increasing inference costs.
Contribution
Introduction of Augmentation Pathways (AP), a new network design that enables stable training with a wide range of augmentation policies, including heavy augmentations, by processing images through multiple neural paths.
Findings
AP improves robustness to heavy augmentations.
Experimental results show better performance on ImageNet.
AP requires fewer parameters and less computation at inference.
Abstract
Data augmentation is practically helpful for visual recognition, especially at the time of data scarcity. However, such success is only limited to quite a few light augmentations (e.g., random crop, flip). Heavy augmentations are either unstable or show adverse effects during training, owing to the big gap between the original and augmented images. This paper introduces a novel network design, noted as Augmentation Pathways (AP), to systematically stabilize training on a much wider range of augmentation policies. Notably, AP tames various heavy data augmentations and stably boosts performance without a careful selection among augmentation policies. Unlike traditional single pathway, augmented images are processed in different neural paths. The main pathway handles the light augmentations, while other pathways focus on the heavier augmentations. By interacting with multiple paths in a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Image Enhancement Techniques · Visual Attention and Saliency Detection
