TL;DR
This paper introduces the Edge Agreement Head, an auxiliary component for Mask R-CNN that leverages edge information to accelerate training and improve segmentation accuracy without affecting inference speed.
Contribution
The novel Edge Agreement Head encourages masks to match image gradients, leading to faster training and better performance in instance segmentation tasks.
Findings
8.1% improvement on MS COCO metrics
Training speed-up with negligible inference impact
Effective use of edge information during training
Abstract
We present an auxiliary task to Mask R-CNN, an instance segmentation network, which leads to faster training of the mask head. Our addition to Mask R-CNN is a new prediction head, the Edge Agreement Head, which is inspired by the way human annotators perform instance segmentation. Human annotators copy the contour of an object instance and only indirectly the occupied instance area. Hence, the edges of instance masks are particularly useful as they characterize the instance well. The Edge Agreement Head therefore encourages predicted masks to have similar image gradients to the ground-truth mask using edge detection filters. We provide a detailed survey of loss combinations and show improvements on the MS COCO Mask metrics compared to using no additional loss. Our approach marginally increases the model size and adds no additional trainable model variables. While the computational costs…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings · Region Proposal Network · Softmax · Convolution · RoIAlign · Mask R-CNN
