Auto-Annotation with Expert-Crafted Guidelines: A Study through 3D LiDAR Detection Benchmark

Yechi Ma; Wei Hua; Shu Kong

arXiv:2506.02914·cs.CV·March 20, 2026

Auto-Annotation with Expert-Crafted Guidelines: A Study through 3D LiDAR Detection Benchmark

Yechi Ma, Wei Hua, Shu Kong

PDF

Open Access

TL;DR

This paper introduces AutoExpert, a benchmark for auto-annotation of 3D LiDAR data using expert-crafted guidelines, leveraging foundation models to improve detection accuracy significantly.

Contribution

It presents a novel benchmark and method for auto-annotation in 3D LiDAR data using foundation models, addressing data-modality and annotation discrepancies.

Findings

01

Boosts 3D detection mAP to 25.4 from 12.1

02

Utilizes foundation models for 2D detection and segmentation

03

Provides a new benchmark with expert-crafted guidelines

Abstract

Data annotation is crucial for developing machine learning solutions. The current paradigm is to hire ordinary human annotators to annotate data instructed by expert-crafted guidelines. As this paradigm is laborious, tedious, and costly, we are motivated to explore auto-annotation with expert-crafted guidelines. To this end, we first develop a supporting benchmark AutoExpert by repurposing the established nuScenes dataset, which has been widely used in autonomous driving research and provides authentic expert-crafted guidelines. The guidelines define 18 object classes using both nuanced language descriptions and a few visual examples, and require annotating objects in LiDAR data with 3D cuboids. Notably, the guidelines do not provide LiDAR visuals to demonstrate how to annotate. Therefore, AutoExpert requires methods to learn on few-shot labeled images and texts to perform 3D detection…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Software Engineering Research