Anchor-Intermediate Detector: Decoupling and Coupling Bounding Boxes for   Accurate Object Detection

Yilong Lv; Min Li; Yujie He; Shaopeng Li; Zhuzhen He; Aitao Yang

arXiv:2310.05666·cs.CV·October 10, 2023·2 cites

Anchor-Intermediate Detector: Decoupling and Coupling Bounding Boxes for Accurate Object Detection

Yilong Lv, Min Li, Yujie He, Shaopeng Li, Zhuzhen He, Aitao Yang

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Anchor-Intermediate Detector (AID) that decouples and couples bounding box corners to improve object detection accuracy, outperforming baseline models on MS COCO without additional bells and whistles.

Contribution

The paper proposes the Box Decouple-Couple (BDC) strategy and a novel AID model with dual heads, enhancing bounding box prediction accuracy in object detection.

Findings

01

AID outperforms RetinaNet and GFL by 2.4 and 1.2 AP on MS COCO.

02

BDC strategy effectively decouples and couples bounding box corners.

03

The corner-aware head improves corner scoring for better detection.

Abstract

Anchor-based detectors have been continuously developed for object detection. However, the individual anchor box makes it difficult to predict the boundary's offset accurately. Instead of taking each bounding box as a closed individual, we consider using multiple boxes together to get prediction boxes. To this end, this paper proposes the \textbf{Box Decouple-Couple(BDC) strategy} in the inference, which no longer discards the overlapping boxes, but decouples the corner points of these boxes. Then, according to each corner's score, we couple the corner points to select the most accurate corner pairs. To meet the BDC strategy, a simple but novel model is designed named the \textbf{Anchor-Intermediate Detector(AID)}, which contains two head networks, i.e., an anchor-based head and an anchor-free \textbf{Corner-aware head}. The corner-aware head is able to score the corners of each…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yilonglv/aid
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection

Methods1x1 Convolution · Focal Loss · Convolution · Feature Pyramid Network · RetinaNet