Fully Test-Time Adaptation for Monocular 3D Object Detection
Hongbin Lin, Yifan Zhang, Shuaicheng Niu, Shuguang Cui, Zhen Li

TL;DR
This paper introduces MonoTTA, a novel test-time adaptation method for monocular 3D object detection that improves model robustness to distribution shifts without requiring labeled data, achieving significant performance gains on KITTI and nuScenes datasets.
Contribution
The paper proposes MonoTTA, a new test-time adaptation framework with reliability-driven and noise-guard strategies for monocular 3D detection under OOD conditions.
Findings
MonoTTA achieves approximately 190% performance gains on KITTI.
MonoTTA achieves approximately 198% performance gains on nuScenes.
The method effectively handles out-of-distribution test data without labeled supervision.
Abstract
Monocular 3D object detection (Mono 3Det) aims to identify 3D objects from a single RGB image. However, existing methods often assume training and test data follow the same distribution, which may not hold in real-world test scenarios. To address the out-of-distribution (OOD) problems, we explore a new adaptation paradigm for Mono 3Det, termed Fully Test-time Adaptation. It aims to adapt a well-trained model to unlabeled test data by handling potential data distribution shifts at test time without access to training data and test labels. However, applying this paradigm in Mono 3Det poses significant challenges due to OOD test data causing a remarkable decline in object detection scores. This decline conflicts with the pre-defined score thresholds of existing detection methods, leading to severe object omissions (i.e., rare positive detections and many false negatives). Consequently, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Medical Image Segmentation Techniques
