PBRnet: Pyramidal Bounding Box Refinement to Improve Object Localization Accuracy
Li Xiao, Yufan Luo, Chunlong Luo, Lianhe Zhao, Quanshui Fu, Guoqing, Yang, Anpeng Huang, Yi Zhao

TL;DR
PBRnet is a novel boundary refinement architecture that enhances object localization accuracy by combining coarse-to-fine and feature pyramid strategies, significantly improving detection performance on MS-COCO.
Contribution
The paper introduces PBRnet, a boundary refinement network that leverages multi-resolution features to improve bounding box localization in object detection.
Findings
PBRnet improves mAP by approximately 3 points when added to FPN or Libra R-CNN.
Replacing the localization branch in Cascade R-CNN with PBRnet's regressor yields an additional 1.5 mAP improvement.
Overall, PBRnet achieves up to 5 points increase in mAP on MS-COCO.
Abstract
Many recently developed object detectors focused on coarse-to-fine framework which contains several stages that classify and regress proposals from coarse-grain to fine-grain, and obtains more accurate detection gradually. Multi-resolution models such as Feature Pyramid Network(FPN) integrate information of different levels of resolution and effectively improve the performance. Previous researches also have revealed that localization can be further improved by: 1) using fine-grained information which is more translational variant; 2) refining local areas which is more focused on local boundary information. Based on these principles, we designed a novel boundary refinement architecture to improve localization accuracy by combining coarse-to-fine framework with feature pyramid structure, named as Pyramidal Bounding Box Refinement network(PBRnet), which parameterizes gradually focused…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques
MethodsResidual Connection · Non-Local Operation · Non-Local Block · Embedded Gaussian Affinity · Balanced Feature Pyramid · IoU-Balanced Sampling · Balanced L1 Loss · Libra R-CNN · Cascade R-CNN · Max Pooling
