Anchor-Intermediate Detector: Decoupling and Coupling Bounding Boxes for Accurate Object Detection
Yilong Lv, Min Li, Yujie He, Shaopeng Li, Zhuzhen He, Aitao Yang

TL;DR
This paper introduces the Anchor-Intermediate Detector (AID) that decouples and couples bounding box corners to improve object detection accuracy, outperforming baseline models on MS COCO without additional bells and whistles.
Contribution
The paper proposes the Box Decouple-Couple (BDC) strategy and a novel AID model with dual heads, enhancing bounding box prediction accuracy in object detection.
Findings
AID outperforms RetinaNet and GFL by 2.4 and 1.2 AP on MS COCO.
BDC strategy effectively decouples and couples bounding box corners.
The corner-aware head improves corner scoring for better detection.
Abstract
Anchor-based detectors have been continuously developed for object detection. However, the individual anchor box makes it difficult to predict the boundary's offset accurately. Instead of taking each bounding box as a closed individual, we consider using multiple boxes together to get prediction boxes. To this end, this paper proposes the \textbf{Box Decouple-Couple(BDC) strategy} in the inference, which no longer discards the overlapping boxes, but decouples the corner points of these boxes. Then, according to each corner's score, we couple the corner points to select the most accurate corner pairs. To meet the BDC strategy, a simple but novel model is designed named the \textbf{Anchor-Intermediate Detector(AID)}, which contains two head networks, i.e., an anchor-based head and an anchor-free \textbf{Corner-aware head}. The corner-aware head is able to score the corners of each…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
Methods1x1 Convolution · Focal Loss · Convolution · Feature Pyramid Network · RetinaNet
