EmbedMask: Embedding Coupling for One-stage Instance Segmentation
Hui Ying, Zhaojin Huang, Shu Liu, Tianjia Shao, Kun Zhou

TL;DR
EmbedMask is a one-stage instance segmentation method that combines detection and embedding techniques to produce high-resolution masks efficiently, outperforming traditional segmentation-based methods.
Contribution
It introduces a unified one-stage approach that couples pixel and proposal embeddings, enabling detailed mask generation with high speed and performance.
Findings
Achieves comparable performance to Mask R-CNN.
Produces more detailed masks at higher speed.
Utilizes embedding coupling for effective pixel clustering.
Abstract
Current instance segmentation methods can be categorized into segmentation-based methods that segment first then do clustering, and proposal-based methods that detect first then predict masks for each instance proposal using repooling. In this work, we propose a one-stage method, named EmbedMask, that unifies both methods by taking advantages of them. Like proposal-based methods, EmbedMask builds on top of detection models making it strong in detection capability. Meanwhile, EmbedMask applies extra embedding modules to generate embeddings for pixels and proposals, where pixel embeddings are guided by proposal embeddings if they belong to the same instance. Through this embedding coupling process, pixels are assigned to the mask of the proposal if their embeddings are similar. The pixel-level clustering enables EmbedMask to generate high-resolution masks without missing details from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Region Proposal Network · Softmax · RoIAlign · Average Pooling · Mask R-CNN · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization
