SODAR: Segmenting Objects by DynamicallyAggregating Neighboring Mask Representations
Tao Wang, Jun Hao Liew, Yu Li, Yunpeng Chen, Jiashi Feng

TL;DR
SODAR enhances one-stage instance segmentation by aggregating neighboring mask representations through a learned, efficient method, significantly improving performance with minimal additional computation.
Contribution
The paper introduces SODAR, a novel aggregation approach that leverages neighboring mask information to improve SOLO-based instance segmentation.
Findings
SODAR outperforms SOLO with ResNet-101 by 2.2 AP on COCO test set.
The aggregation method reduces computation by sharing mask representations.
SODAR achieves consistent gains with SOLOv2.
Abstract
Recent state-of-the-art one-stage instance segmentation model SOLO divides the input image into a grid and directly predicts per grid cell object masks with fully-convolutional networks, yielding comparably good performance as traditional two-stage Mask R-CNN yet enjoying much simpler architecture and higher efficiency. We observe SOLO generates similar masks for an object at nearby grid cells, and these neighboring predictions can complement each other as some may better segment certain object part, most of which are however directly discarded by non-maximum-suppression. Motivated by the observed gap, we develop a novel learning-based aggregation method that improves upon SOLO by leveraging the rich neighboring information while maintaining the architectural efficiency. The resulting model is named SODAR. Unlike the original per grid cell object masks, SODAR is implicitly supervised to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Medical Image Segmentation Techniques · Medical Imaging and Analysis
MethodsRegion Proposal Network · Softmax · RoIAlign · Convolution · Mask R-CNN
