Augmentation Pathways Network for Visual Recognition

Yalong Bai; Mohan Zhou; Wei Zhang; Bowen Zhou; Tao Mei

arXiv:2107.11990·cs.CV·March 17, 2023·1 cites

Augmentation Pathways Network for Visual Recognition

Yalong Bai, Mohan Zhou, Wei Zhang, Bowen Zhou, Tao Mei

PDF

Open Access 1 Repo

TL;DR

This paper proposes Augmentation Pathways, a novel neural network architecture that stabilizes training with diverse heavy data augmentations, improving visual recognition performance without increasing inference costs.

Contribution

Introduction of Augmentation Pathways (AP), a new network design that enables stable training with a wide range of augmentation policies, including heavy augmentations, by processing images through multiple neural paths.

Findings

01

AP improves robustness to heavy augmentations.

02

Experimental results show better performance on ImageNet.

03

AP requires fewer parameters and less computation at inference.

Abstract

Data augmentation is practically helpful for visual recognition, especially at the time of data scarcity. However, such success is only limited to quite a few light augmentations (e.g., random crop, flip). Heavy augmentations are either unstable or show adverse effects during training, owing to the big gap between the original and augmented images. This paper introduces a novel network design, noted as Augmentation Pathways (AP), to systematically stabilize training on a much wider range of augmentation policies. Notably, AP tames various heavy data augmentations and stably boosts performance without a careful selection among augmentation policies. Unlike traditional single pathway, augmented images are processed in different neural paths. The main pathway handles the light augmentations, while other pathways focus on the heavier augmentations. By interacting with multiple paths in a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ap-conv/ap-net
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Image Enhancement Techniques · Visual Attention and Saliency Detection