CenterNet: Keypoint Triplets for Object Detection
Kaiwen Duan, Song Bai, Lingxi Xie, Honggang Qi, Qingming Huang, Qi, Tian

TL;DR
CenterNet introduces a novel keypoint triplet approach for object detection, enhancing accuracy and speed by detecting objects as triplets of keypoints and employing specialized pooling modules, outperforming existing detectors on MS-COCO.
Contribution
The paper proposes CenterNet, a new object detection framework that models objects as triplets of keypoints, with novel pooling modules, improving detection precision and recall over prior keypoint-based methods.
Findings
Achieves 47.0% AP on MS-COCO, outperforming existing one-stage detectors.
Provides faster inference speed with comparable performance to two-stage detectors.
Introduces cascade corner pooling and center pooling modules for better feature representation.
Abstract
In object detection, keypoint-based approaches often suffer a large number of incorrect object bounding boxes, arguably due to the lack of an additional look into the cropped regions. This paper presents an efficient solution which explores the visual patterns within each cropped region with minimal costs. We build our framework upon a representative one-stage keypoint-based detector named CornerNet. Our approach, named CenterNet, detects each object as a triplet, rather than a pair, of keypoints, which improves both precision and recall. Accordingly, we design two customized modules named cascade corner pooling and center pooling, which play the roles of enriching information collected by both top-left and bottom-right corners and providing more recognizable information at the central regions, respectively. On the MS-COCO dataset, CenterNet achieves an AP of 47.0%, which outperforms…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsDeep Layer Aggregation · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Max Pooling · Residual Connection · Convolution · Hourglass Module · Corner Pooling · Batch Normalization · CornerNet
