Simple Copy-Paste is a Strong Data Augmentation Method for Instance   Segmentation

Golnaz Ghiasi; Yin Cui; Aravind Srinivas; Rui Qian; Tsung-Yi Lin; Ekin; D. Cubuk; Quoc V. Le; Barret Zoph

arXiv:2012.07177·cs.CV·June 24, 2021

Simple Copy-Paste is a Strong Data Augmentation Method for Instance Segmentation

Golnaz Ghiasi, Yin Cui, Aravind Srinivas, Rui Qian, Tsung-Yi Lin, Ekin, D. Cubuk, Quoc V. Le, Barret Zoph

PDF

5 Repos

TL;DR

This paper demonstrates that a simple random copy-paste data augmentation method significantly improves instance segmentation performance, outperforming previous complex approaches and achieving state-of-the-art results on COCO and LVIS benchmarks.

Contribution

The study shows that a straightforward copy-paste augmentation without complex context modeling is highly effective for instance segmentation, enhancing performance over prior methods.

Findings

01

Achieves 49.1 mask AP on COCO, surpassing previous state-of-the-art.

02

Outperforms LVIS 2020 Challenge winner by +3.6 mask AP on rare categories.

03

Copy-Paste augmentation is additive with semi-supervised methods.

Abstract

Building instance segmentation models that are data-efficient and can handle rare object categories is an important challenge in computer vision. Leveraging data augmentations is a promising direction towards addressing this challenge. Here, we perform a systematic study of the Copy-Paste augmentation ([13, 12]) for instance segmentation where we randomly paste objects onto an image. Prior studies on Copy-Paste relied on modeling the surrounding visual context for pasting the objects. However, we find that the simple mechanism of pasting objects randomly is good enough and can provide solid gains on top of strong baselines. Furthermore, we show Copy-Paste is additive with semi-supervised methods that leverage extra data through pseudo labeling (e.g. self-training). On COCO instance segmentation, we achieve 49.1 mask AP and 57.3 box AP, an improvement of +0.6 mask AP and +1.5 box AP over…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsRegion Proposal Network · Pointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Entropy Regularization · Sigmoid Activation · Proximal Policy Optimization · Residual Connection · Tanh Activation · Long Short-Term Memory