You Only Label Once: 3D Box Adaptation from Point Cloud to Image via Semi-Supervised Learning
Jieqi Shi, Peiliang Li, Xiaozhi Chen, Shaojie Shen

TL;DR
This paper introduces a semi-supervised learning method that automatically adapts 3D Lidar bounding boxes to fit image appearances, reducing annotation effort for 3D object detection in multi-camera systems.
Contribution
It proposes a novel 3D box adaptation approach that refines Lidar boxes for image projection using minimal 2D annotations, enabling efficient and accurate cuboid annotation.
Findings
Achieves human-level cuboid annotation accuracy on images.
Reduces labeling effort by adapting 3D boxes with minimal supervision.
Demonstrates effectiveness on Waymo and NuScenes datasets.
Abstract
The image-based 3D object detection task expects that the predicted 3D bounding box has a ``tightness'' projection (also referred to as cuboid), which fits the object contour well on the image while still keeping the geometric attribute on the 3D space, e.g., physical dimension, pairwise orthogonal, etc. These requirements bring significant challenges to the annotation. Simply projecting the Lidar-labeled 3D boxes to the image leads to non-trivial misalignment, while directly drawing a cuboid on the image cannot access the original 3D information. In this work, we propose a learning-based 3D box adaptation approach that automatically adjusts minimum parameters of the 360 Lidar 3D bounding box to perfectly fit the image appearance of panoramic cameras. With only a few 2D boxes annotation as guidance during the training phase, our network can produce accurate image-level cuboid…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · 3D Surveying and Cultural Heritage
