TL;DR
This paper introduces PRB-FPN, a novel feature pyramid network with bi-directional fusion and residual modules, significantly improving accuracy in single-shot object detection across various object sizes.
Contribution
The paper proposes a parallel bi-directional feature pyramid with residual and purification modules, enhancing localization and detection accuracy for small and large objects.
Findings
Achieves state-of-the-art results on UAVDT17 and MS COCO datasets.
Effectively detects both small and large objects simultaneously.
Facilitates easy training with deeper or lighter backbones.
Abstract
This paper proposes the Parallel Residual Bi-Fusion Feature Pyramid Network (PRB-FPN) for fast and accurate single-shot object detection. Feature Pyramid (FP) is widely used in recent visual detection, however the top-down pathway of FP cannot preserve accurate localization due to pooling shifting. The advantage of FP is weakened as deeper backbones with more layers are used. In addition, it cannot keep up accurate detection of both small and large objects at the same time. To address these issues, we propose a new parallel FP structure with bi-directional (top-down and bottom-up) fusion and associated improvements to retain high-quality features for accurate localization. We provide the following design improvements: (1) A parallel bifusion FP structure with a bottom-up fusion module (BFM) to detect both small and large objects at once with high accuracy. (2) A concatenation and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
