ShapeMask: Learning to Segment Novel Objects by Refining Shape Priors
Weicheng Kuo, Anelia Angelova, Jitendra Malik, Tsung-Yi Lin

TL;DR
ShapeMask introduces a shape prior-based approach for instance segmentation that generalizes well to novel categories, outperforming existing methods and maintaining efficiency and robustness.
Contribution
It proposes a novel shape prior-based method for instance segmentation that improves generalization to unseen categories and enhances robustness.
Findings
Outperforms state-of-the-art by 6.4 and 3.8 AP across categories
Robust to detection errors and limited training data
Runs efficiently with 150ms inference time
Abstract
Instance segmentation aims to detect and segment individual objects in a scene. Most existing methods rely on precise mask annotations of every category. However, it is difficult and costly to segment objects in novel categories because a large number of mask annotations is required. We introduce ShapeMask, which learns the intermediate concept of object shape to address the problem of generalization in instance segmentation to novel categories. ShapeMask starts with a bounding box detection and gradually refines it by first estimating the shape of the detected object through a collection of shape priors. Next, ShapeMask refines the coarse shape into an instance level mask by learning instance embeddings. The shape priors provide a strong cue for object-like prediction, and the instance embeddings model the instance specific appearance information. ShapeMask significantly outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications
MethodsSigmoid Activation · Entropy Regularization · Tanh Activation · Proximal Policy Optimization · Average Pooling · Softmax · Long Short-Term Memory · Neural Architecture Search · Batch Normalization · Convolution
