TensorMask: A Foundation for Dense Object Segmentation
Xinlei Chen, Ross Girshick, Kaiming He, Piotr Doll\'ar

TL;DR
TensorMask introduces a novel dense sliding-window approach for instance segmentation that models geometric structures explicitly using 4D tensors, achieving results comparable to Mask R-CNN and opening new research avenues.
Contribution
The paper formalizes dense instance segmentation as a 4D tensor prediction task and presents TensorMask, a framework that captures geometric structure for improved dense mask prediction.
Findings
TensorMask outperforms baseline methods ignoring tensor structure.
Results are comparable to Mask R-CNN in dense instance segmentation.
Explicit geometric modeling via 4D tensors enhances segmentation performance.
Abstract
Sliding-window object detectors that generate bounding-box object predictions over a dense, regular grid have advanced rapidly and proven popular. In contrast, modern instance segmentation approaches are dominated by methods that first detect object bounding boxes, and then crop and segment these regions, as popularized by Mask R-CNN. In this work, we investigate the paradigm of dense sliding-window instance segmentation, which is surprisingly under-explored. Our core observation is that this task is fundamentally different than other dense prediction tasks such as semantic segmentation or bounding-box object detection, as the output at every spatial location is itself a geometric structure with its own spatial dimensions. To formalize this, we treat dense instance segmentation as a prediction task over 4D tensors and present a general framework called TensorMask that explicitly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Adversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning
MethodsRegion Proposal Network · Softmax · RoIAlign · Average Pooling · Mask R-CNN · Residual Connection · *Communicated@Fast*How Do I Communicate to Expedia? · 1x1 Convolution · Batch Normalization · Bottleneck Residual Block
