TL;DR
This paper introduces Self-Balanced R-CNN, a novel instance segmentation model that addresses training imbalances and feature-level issues, achieving state-of-the-art results with fewer parameters and improved accuracy.
Contribution
It proposes new loop mechanisms, an improved RoI extraction, and a fully convolutional architecture head to enhance instance segmentation performance.
Findings
Achieves 45.3% AP for detection and 41.5% AP for segmentation on COCO.
Addresses IoU distribution imbalance and feature-level imbalance effectively.
Reduces model complexity while maintaining or improving accuracy.
Abstract
Current state-of-the-art two-stage models on instance segmentation task suffer from several types of imbalances. In this paper, we address the Intersection over the Union (IoU) distribution imbalance of positive input Regions of Interest (RoIs) during the training of the second stage. Our Self-Balanced R-CNN (SBR-CNN), an evolved version of the Hybrid Task Cascade (HTC) model, brings brand new loop mechanisms of bounding box and mask refinements. With an improved Generic RoI Extraction (GRoIE), we also address the feature-level imbalance at the Feature Pyramid Network (FPN) level, originated by a non-uniform integration between low- and high-level features from the backbone layers. In addition, the redesign of the architecture heads toward a fully convolutional approach with FCC further reduces the number of parameters and obtains more clues to the connection between the task to solve…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
