Learnability-Driven Submodular Optimization for Active Roadside 3D Detection
Ruiyu Mao, Baoming Zhang, Nicholas Ruozzi, Yunhui Guo

TL;DR
This paper introduces LH3D, an active learning framework for roadside monocular 3D detection that prioritizes learnability over uncertainty, reducing annotation effort while maintaining high detection performance.
Contribution
It proposes a learnability-driven scene selection method for active learning in roadside 3D detection, effectively suppressing ambiguous samples and improving annotation efficiency.
Findings
Achieves over 86% of full performance with only 25% annotation budget
Outperforms uncertainty-based active learning baselines
Demonstrates learnability is more critical than uncertainty in this context
Abstract
Roadside perception datasets are typically constructed via cooperative labeling between synchronized vehicle and roadside frame pairs. However, real deployment often requires annotation of roadside-only data due to hardware and privacy constraints. Even human experts struggle to produce accurate labels without vehicle-side data (image, LIDAR), which not only increases annotation difficulty and cost, but also reveals a fundamental learnability problem: many roadside-only scenes contain distant, blurred, or occluded objects whose 3D properties are ambiguous from a single view and can only be reliably annotated by cross-checking paired vehicle--roadside frames. We refer to such cases as inherently ambiguous samples. To reduce wasted annotation effort on inherently ambiguous samples while still obtaining high-performing models, we turn to active learning. This work focuses on active…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Domain Adaptation and Few-Shot Learning
