Recursively Refined R-CNN: Instance Segmentation with Self-RoI   Rebalancing

Leonardo Rossi; Akbar Karimi; Andrea Prati

arXiv:2104.01329·cs.CV·June 22, 2022

Recursively Refined R-CNN: Instance Segmentation with Self-RoI Rebalancing

Leonardo Rossi, Akbar Karimi, Andrea Prati

PDF

1 Repo

TL;DR

The paper introduces R^3-CNN, a recursive instance segmentation model that improves accuracy and reduces complexity by replacing cascade architectures with a loop mechanism and recursive re-sampling, outperforming recent models.

Contribution

It proposes a novel recursive re-sampling and loop mechanism in R^3-CNN that enhances instance segmentation performance while reducing model complexity.

Findings

01

R^3-CNN surpasses HTC in accuracy on COCO dataset.

02

It achieves performance improvements independently of the baseline model.

03

The architecture significantly reduces the number of parameters.

Abstract

Within the field of instance segmentation, most of the state-of-the-art deep learning networks rely nowadays on cascade architectures, where multiple object detectors are trained sequentially, re-sampling the ground truth at each step. This offers a solution to the problem of exponentially vanishing positive samples. However, it also translates into an increase in network complexity in terms of the number of parameters. To address this issue, we propose Recursively Refined R-CNN (R^3-CNN) which avoids duplicates by introducing a loop mechanism instead. At the same time, it achieves a quality boost using a recursive re-sampling technique, where a specific IoU quality is utilized in each recursion to eventually equally cover the positive spectrum. Our experiments highlight the specific encoding of the loop mechanism in the weights, requiring its usage at inference time. The R^3-CNN…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IMPLabUniPr/mmdetection
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsNon-Local Operation · Residual Connection · Non-Local Block · Generic RoI Extractor · 1x1 Convolution · Convolution