Auto-Annotation with Expert-Crafted Guidelines: A Study through 3D LiDAR Detection Benchmark
Yechi Ma, Wei Hua, Shu Kong

TL;DR
This paper introduces AutoExpert, a benchmark for auto-annotation of 3D LiDAR data using expert-crafted guidelines, leveraging foundation models to improve detection accuracy significantly.
Contribution
It presents a novel benchmark and method for auto-annotation in 3D LiDAR data using foundation models, addressing data-modality and annotation discrepancies.
Findings
Boosts 3D detection mAP to 25.4 from 12.1
Utilizes foundation models for 2D detection and segmentation
Provides a new benchmark with expert-crafted guidelines
Abstract
Data annotation is crucial for developing machine learning solutions. The current paradigm is to hire ordinary human annotators to annotate data instructed by expert-crafted guidelines. As this paradigm is laborious, tedious, and costly, we are motivated to explore auto-annotation with expert-crafted guidelines. To this end, we first develop a supporting benchmark AutoExpert by repurposing the established nuScenes dataset, which has been widely used in autonomous driving research and provides authentic expert-crafted guidelines. The guidelines define 18 object classes using both nuanced language descriptions and a few visual examples, and require annotating objects in LiDAR data with 3D cuboids. Notably, the guidelines do not provide LiDAR visuals to demonstrate how to annotate. Therefore, AutoExpert requires methods to learn on few-shot labeled images and texts to perform 3D detection…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Software Engineering Research
