MonoLSS: Learnable Sample Selection For Monocular 3D Detection
Zhenjia Li, Jinrang Jia, Yifeng Shi

TL;DR
MonoLSS introduces a learnable sample selection module and a data augmentation method for monocular 3D detection, significantly improving accuracy and stability in autonomous driving applications.
Contribution
The paper proposes the LSS module based on Gumbel-Softmax for adaptive sample selection and MixUp3D for data augmentation, enhancing monocular 3D detection performance.
Findings
Achieved 1st place on KITTI 3D detection benchmark in all categories.
Demonstrated synergistic improvements when combining LSS and MixUp3D.
Outperformed previous methods on Waymo and KITTI-nuScenes datasets.
Abstract
In the field of autonomous driving, monocular 3D detection is a critical task which estimates 3D properties (depth, dimension, and orientation) of objects in a single RGB image. Previous works have used features in a heuristic way to learn 3D properties, without considering that inappropriate features could have adverse effects. In this paper, sample selection is introduced that only suitable samples should be trained to regress the 3D properties. To select samples adaptively, we propose a Learnable Sample Selection (LSS) module, which is based on Gumbel-Softmax and a relative-distance sample divider. The LSS module works under a warm-up strategy leading to an improvement in training stability. Additionally, since the LSS module dedicated to 3D property sample selection relies on object-level features, we further develop a data augmentation method named MixUp3D to enrich 3D property…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Machine Learning and Data Classification
