IDMS: Instance Depth for Multi-scale Monocular 3D Object Detection
Chao Hu, Liqiang Zhu, Weibing Qiu, Weijie Wu

TL;DR
This paper introduces IDMS, a novel multi-scale perception module and auxiliary depth learning approach that significantly improves monocular 3D object detection accuracy, especially for different object scales.
Contribution
The paper proposes a multi-scale perception module based on dilated convolution and uses instance depth as an auxiliary task to enhance 3D perception in monocular detection.
Findings
Achieved 5.27% improvement in AP40 for cars on KITTI dataset.
Enhanced detection performance across multiple object scales.
Validated effectiveness through experiments on KITTI test and evaluation sets.
Abstract
Due to the lack of depth information of images and poor detection accuracy in monocular 3D object detection, we proposed the instance depth for multi-scale monocular 3D object detection method. Firstly, to enhance the model's processing ability for different scale targets, a multi-scale perception module based on dilated convolution is designed, and the depth features containing multi-scale information are re-refined from both spatial and channel directions considering the inconsistency between feature maps of different scales. Firstly, we designed a multi-scale perception module based on dilated convolution to enhance the model's processing ability for different scale targets. The depth features containing multi-scale information are re-refined from spatial and channel directions considering the inconsistency between feature maps of different scales. Secondly, so as to make the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Visual Attention and Saliency Detection
MethodsTest · Convolution · Dilated Convolution
