Towards Long-Tailed 3D Detection
Neehar Peri, Achal Dave, Deva Ramanan, Shu Kong

TL;DR
This paper introduces Long-Tailed 3D Detection (LT3D), a framework that evaluates and improves detection of rare classes in autonomous vehicle data by leveraging hierarchical relationships and multimodal fusion, significantly boosting accuracy especially for tail classes.
Contribution
The paper formalizes LT3D, adapting existing 3D detection models for long-tailed class distributions, and proposes hierarchical losses and multimodal fusion techniques to enhance rare class detection.
Findings
Accuracy for all classes improved by 5% AP on average.
Significant boost in rare class AP, e.g., stroller AP from 3.6 to 31.6.
Hierarchical and multimodal methods are key to long-tailed 3D detection.
Abstract
Contemporary autonomous vehicle (AV) benchmarks have advanced techniques for training 3D detectors, particularly on large-scale lidar data. Surprisingly, although semantic class labels naturally follow a long-tailed distribution, contemporary benchmarks focus on only a few common classes (e.g., pedestrian and car) and neglect many rare classes in-the-tail (e.g., debris and stroller). However, AVs must still detect rare classes to ensure safe operation. Moreover, semantic classes are often organized within a hierarchy, e.g., tail classes such as child and construction-worker are arguably subclasses of pedestrian. However, such hierarchical relationships are often ignored, which may lead to misleading estimates of performance and missed opportunities for algorithmic innovation. We address these challenges by formally studying the problem of Long-Tailed 3D Detection (LT3D), which evaluates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Autonomous Vehicle Technology and Safety · Video Surveillance and Tracking Methods
