Segment, Lift and Fit: Automatic 3D Shape Labeling from 2D Prompts
Jianhao Li, Tianyu Sun, Zhongdao Wang, Enze Xie, Bailan Feng, Hongbo, Zhang, Ze Yuan, Ke Xu, Jiaheng Liu, and Ping Luo

TL;DR
This paper introduces an automatic 3D shape labeling algorithm from 2D prompts that generalizes well across datasets and produces high-quality annotations without dataset-specific training.
Contribution
The proposed SLF paradigm predicts 3D shapes from 2D masks without dataset training, improving generalization and annotation quality in autonomous driving applications.
Findings
Achieves nearly 90% [email protected] IoU on KITTI dataset.
Detectors trained with SLF-generated labels perform close to those trained with ground truth.
Provides detailed 3D shape predictions for dynamic objects.
Abstract
This paper proposes an algorithm for automatically labeling 3D objects from 2D point or box prompts, especially focusing on applications in autonomous driving. Unlike previous arts, our auto-labeler predicts 3D shapes instead of bounding boxes and does not require training on a specific dataset. We propose a Segment, Lift, and Fit (SLF) paradigm to achieve this goal. Firstly, we segment high-quality instance masks from the prompts using the Segment Anything Model (SAM) and transform the remaining problem into predicting 3D shapes from given 2D masks. Due to the ill-posed nature of this problem, it presents a significant challenge as multiple 3D shapes can project into an identical mask. To tackle this issue, we then lift 2D masks to 3D forms and employ gradient descent to adjust their poses and shapes until the projections fit the masks and the surfaces conform to surrounding LiDAR…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topics3D Shape Modeling and Analysis · Advanced Numerical Analysis Techniques · Image Processing and 3D Reconstruction
MethodsSparse Evolutionary Training
