ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using   2D Labels Only

Saad Lahlali; Nicolas Granger; Herv\'e Le Borgne; Quoc-Cuong Pham

arXiv:2407.17197·cs.CV·November 28, 2024

ALPI: Auto-Labeller with Proxy Injection for 3D Object Detection using 2D Labels Only

Saad Lahlali, Nicolas Granger, Herv\'e Le Borgne, Quoc-Cuong Pham

PDF

1 Repo

TL;DR

This paper introduces ALPI, a weakly supervised 3D object detection method that uses only 2D labels and size priors, employing proxy objects and pseudo-labeling to achieve competitive results.

Contribution

ALPI presents a novel approach for 3D detection using 2D annotations, proxy objects, and depth-invariant loss functions, reducing annotation costs.

Findings

01

Performs on-par or better than previous methods on KITTI Car detection.

02

Achieves near fully supervised performance on challenging classes.

03

Demonstrates robustness on the nuScenes dataset.

Abstract

3D object detection plays a crucial role in various applications such as autonomous vehicles, robotics and augmented reality. However, training 3D detectors requires a costly precise annotation, which is a hindrance to scaling annotation to large datasets. To address this challenge, we propose a weakly supervised 3D annotator that relies solely on 2D bounding box annotations from images, along with size priors. One major problem is that supervising a 3D detection model using only 2D boxes is not reliable due to ambiguities between different 3D poses and their identical 2D projection. We introduce a simple yet effective and generic solution: we build 3D proxy objects with annotations by construction and add them to the training dataset. Our method requires only size priors to adapt to new classes. To better align 2D supervision with 3D detection, our method ensures depth invariance with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

cea-list/alpi
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsALIGN