GridMask Data Augmentation
Pengguang Chen, Shu Liu, Hengshuang Zhao, Xingquan Wang, Jiaya Jia

TL;DR
GridMask is a new data augmentation technique that systematically removes regions of input images, leading to improved performance across various computer vision tasks with less computational cost than existing methods.
Contribution
The paper introduces GridMask, a simple yet effective structured information removal method that outperforms state-of-the-art augmentation techniques like AutoAugment.
Findings
Outperforms AutoAugment on multiple datasets
Improves recognition, detection, and segmentation performance
Less computationally expensive than reinforcement learning-based methods
Abstract
We propose a novel data augmentation method `GridMask' in this paper. It utilizes information removal to achieve state-of-the-art results in a variety of computer vision tasks. We analyze the requirement of information dropping. Then we show limitation of existing information dropping algorithms and propose our structured method, which is simple and yet very effective. It is based on the deletion of regions of the input image. Our extensive experiments show that our method outperforms the latest AutoAugment, which is way more computationally expensive due to the use of reinforcement learning to find the best policies. On the ImageNet dataset for recognition, COCO2017 object detection, and on Cityscapes dataset for semantic segmentation, our method all notably improves performance over baselines. The extensive experiments manifest the effectiveness and generality of the new method.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning
MethodsGridMask · Sigmoid Activation · Tanh Activation · Long Short-Term Memory · AutoAugment
