TL;DR
CenterMask introduces a real-time, anchor-free instance segmentation method with a novel spatial attention-guided mask, combined with improved backbone networks, achieving state-of-the-art accuracy and speed.
Contribution
The paper presents a new anchor-free segmentation approach with SAG-Mask and improved VoVNetV2 backbone, outperforming previous methods in accuracy and efficiency.
Findings
CenterMask achieves 38.3% AP with ResNet-101-FPN, surpassing previous methods.
CenterMask-Lite runs at over 35fps, outperforming state-of-the-art real-time methods.
The proposed VoVNetV2 backbone improves feature extraction for vision tasks.
Abstract
We propose a simple yet efficient anchor-free instance segmentation, called CenterMask, that adds a novel spatial attention-guided mask (SAG-Mask) branch to anchor-free one stage object detector (FCOS) in the same vein with Mask R-CNN. Plugged into the FCOS object detector, the SAG-Mask branch predicts a segmentation mask on each box with the spatial attention map that helps to focus on informative pixels and suppress noise. We also present an improved backbone networks, VoVNetV2, with two effective strategies: (1) residual connection for alleviating the optimization problem of larger VoVNet \cite{lee2019energy} and (2) effective Squeeze-Excitation (eSE) dealing with the channel information loss problem of original SE. With SAG-Mask and VoVNetV2, we deign CenterMask and CenterMask-Lite that are targeted to large and small models, respectively. Using the same ResNet-101-FPN backbone,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
CenterMask: Real-Time Anchor-Free Instance Segmentation· youtube
Taxonomy
MethodsRegion Proposal Network · One-Shot Aggregation · Non Maximum Suppression · Concatenated Skip Connection · Softmax · Average Pooling · ResNeXt Block · VoVNet · FCOS · OSA (identity mapping + eSE)
