Lite-FPN for Keypoint-based Monocular 3D Object Detection
Lei Yang, Xinyu Zhang, Li Wang, Minghan Zhu, Jun Li

TL;DR
This paper introduces Lite-FPN, a lightweight feature pyramid network that enhances multi-scale detection in monocular 3D object detection, along with an attention loss to improve localization accuracy, resulting in better speed and precision.
Contribution
The paper proposes Lite-FPN for efficient multi-scale feature fusion and a novel attention loss to align confidence with localization quality in keypoint-based monocular 3D detection.
Findings
Significant accuracy improvements on KITTI dataset
Enhanced detection speed with Lite-FPN
Better alignment of confidence and localization with attention loss
Abstract
3D object detection with a single image is an essential and challenging task for autonomous driving. Recently, keypoint-based monocular 3D object detection has made tremendous progress and achieved great speed-accuracy trade-off. However, there still exists a huge gap with LIDAR-based methods in terms of accuracy. To improve their performance without sacrificing efficiency, we propose a sort of lightweight feature pyramid network called Lite-FPN to achieve multi-scale feature fusion in an effective and efficient way, which can boost the multi-scale detection capability of keypoint-based detectors. Besides, the misalignment between classification score and localization precision is further relieved by introducing a novel regression loss named attention loss. With the proposed loss, predictions with high confidence but poor localization are treated with more attention during the training…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Robotics and Sensor-Based Localization
