TL;DR
This paper demonstrates that a simple random copy-paste data augmentation method significantly improves instance segmentation performance, outperforming previous complex approaches and achieving state-of-the-art results on COCO and LVIS benchmarks.
Contribution
The study shows that a straightforward copy-paste augmentation without complex context modeling is highly effective for instance segmentation, enhancing performance over prior methods.
Findings
Achieves 49.1 mask AP on COCO, surpassing previous state-of-the-art.
Outperforms LVIS 2020 Challenge winner by +3.6 mask AP on rare categories.
Copy-Paste augmentation is additive with semi-supervised methods.
Abstract
Building instance segmentation models that are data-efficient and can handle rare object categories is an important challenge in computer vision. Leveraging data augmentations is a promising direction towards addressing this challenge. Here, we perform a systematic study of the Copy-Paste augmentation ([13, 12]) for instance segmentation where we randomly paste objects onto an image. Prior studies on Copy-Paste relied on modeling the surrounding visual context for pasting the objects. However, we find that the simple mechanism of pasting objects randomly is good enough and can provide solid gains on top of strong baselines. Furthermore, we show Copy-Paste is additive with semi-supervised methods that leverage extra data through pseudo labeling (e.g. self-training). On COCO instance segmentation, we achieve 49.1 mask AP and 57.3 box AP, an improvement of +0.6 mask AP and +1.5 box AP over…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsRegion Proposal Network · Pointwise Convolution · Depthwise Convolution · Depthwise Separable Convolution · Entropy Regularization · Sigmoid Activation · Proximal Policy Optimization · Residual Connection · Tanh Activation · Long Short-Term Memory
