Homography Loss for Monocular 3D Object Detection
Jiaqi Gu, Bojian Wu, Lubin Fan, Jianqiang Huang, Shen Cao, Zhiyu, Xiang, Xian-Sheng Hua

TL;DR
This paper introduces Homography Loss, a novel differentiable loss function that leverages spatial relationships and 2D detections to improve monocular 3D object detection accuracy by globally constraining 3D box predictions.
Contribution
It proposes a universal Homography Loss that incorporates geometric relations among objects and 2D guidance, significantly enhancing existing monocular 3D detectors.
Findings
Achieves state-of-the-art performance on KITTI 3D dataset.
Effectively integrates 2D and 3D information for better box estimation.
Boosts baseline detector accuracy with a simple, universal loss function.
Abstract
Monocular 3D object detection is an essential task in autonomous driving. However, most current methods consider each 3D object in the scene as an independent training sample, while ignoring their inherent geometric relations, thus inevitably resulting in a lack of leveraging spatial constraints. In this paper, we propose a novel method that takes all the objects into consideration and explores their mutual relationships to help better estimate the 3D boxes. Moreover, since 2D detection is more reliable currently, we also investigate how to use the detected 2D boxes as guidance to globally constrain the optimization of the corresponding predicted 3D boxes. To this end, a differentiable loss function, termed as Homography Loss, is proposed to achieve the goal, which exploits both 2D and 3D information, aiming at balancing the positional relationships between different objects by global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Robotics and Sensor-Based Localization
