Cascade R-CNN: High Quality Object Detection and Instance Segmentation
Zhaowei Cai, Nuno Vasconcelos

TL;DR
Cascade R-CNN introduces a multi-stage detection architecture trained with increasing IoU thresholds, significantly improving high-quality object detection and instance segmentation performance across multiple datasets.
Contribution
It proposes a novel multi-stage cascade framework that addresses overfitting and quality mismatch issues in object detection, achieving state-of-the-art results.
Findings
State-of-the-art performance on COCO dataset.
Significant improvements in high-quality detection on various datasets.
Enhanced instance segmentation with Cascade R-CNN.
Abstract
In object detection, the intersection over union (IoU) threshold is frequently used to define positives/negatives. The threshold used to train a detector defines its \textit{quality}. While the commonly used threshold of 0.5 leads to noisy (low-quality) detections, detection performance frequently degrades for larger thresholds. This paradox of high-quality detection has two causes: 1) overfitting, due to vanishing positive samples for large thresholds, and 2) inference-time quality mismatch between detector and test hypotheses. A multi-stage object detection architecture, the Cascade R-CNN, composed of a sequence of detectors trained with increasing IoU thresholds, is proposed to address these problems. The detectors are trained sequentially, using the output of a detector as training set for the next. This resampling progressively improves hypotheses quality, guaranteeing a positive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsRegion Proposal Network · Softmax · RoIAlign · Mask R-CNN · Cascade R-CNN · Convolution
