TL;DR
This paper introduces ALPI, a weakly supervised 3D object detection method that uses only 2D labels and size priors, employing proxy objects and pseudo-labeling to achieve competitive results.
Contribution
ALPI presents a novel approach for 3D detection using 2D annotations, proxy objects, and depth-invariant loss functions, reducing annotation costs.
Findings
Performs on-par or better than previous methods on KITTI Car detection.
Achieves near fully supervised performance on challenging classes.
Demonstrates robustness on the nuScenes dataset.
Abstract
3D object detection plays a crucial role in various applications such as autonomous vehicles, robotics and augmented reality. However, training 3D detectors requires a costly precise annotation, which is a hindrance to scaling annotation to large datasets. To address this challenge, we propose a weakly supervised 3D annotator that relies solely on 2D bounding box annotations from images, along with size priors. One major problem is that supervising a 3D detection model using only 2D boxes is not reliable due to ambiguities between different 3D poses and their identical 2D projection. We introduce a simple yet effective and generic solution: we build 3D proxy objects with annotations by construction and add them to the training dataset. Our method requires only size priors to adapt to new classes. To better align 2D supervision with 3D detection, our method ensures depth invariance with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsALIGN
