Cascade RPN: Delving into High-Quality Region Proposal Network with Adaptive Convolution
Thang Vu, Hyunjun Jang, Trung X. Pham, Chang D. Yoo

TL;DR
This paper introduces Cascade RPN, a multi-stage region proposal network with adaptive convolution that refines proposals more effectively, leading to significant improvements in region proposal quality and detection accuracy.
Contribution
It proposes a novel Cascade RPN architecture with single anchors and adaptive convolution for better alignment and refinement, surpassing existing methods in proposal quality.
Findings
Achieves 13.4 points higher AR than conventional RPN.
Improves detection mAP by 3.1 and 3.5 points with Fast R-CNN and Faster R-CNN.
Outperforms existing region proposal methods.
Abstract
This paper considers an architecture referred to as Cascade Region Proposal Network (Cascade RPN) for improving the region-proposal quality and detection performance by \textit{systematically} addressing the limitation of the conventional RPN that \textit{heuristically defines} the anchors and \textit{aligns} the features to the anchors. First, instead of using multiple anchors with predefined scales and aspect ratios, Cascade RPN relies on a \textit{single anchor} per location and performs multi-stage refinement. Each stage is progressively more stringent in defining positive samples by starting out with an anchor-free metric followed by anchor-based metrics in the ensuing stages. Second, to attain alignment between the features and the anchors throughout the stages, \textit{adaptive convolution} is proposed that takes the anchors in addition to the image features as its input and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsFaster R-CNN · Region Proposal Network · Softmax · Convolution · RoIPool · Fast R-CNN
