BEVUDA: Multi-geometric Space Alignments for Domain Adaptive BEV 3D Object Detection
Jiaming Liu, Rongyu Zhang, Xiaoqi Li, Xiaowei Chi, Zehui Chen, Ming, Lu, Yandong Guo, Shanghang Zhang

TL;DR
This paper introduces a novel multi-space alignment framework for domain adaptation in BEV 3D object detection, effectively reducing domain gaps across multiple geometric representations and improving transfer performance in autonomous driving scenarios.
Contribution
The paper proposes a Multi-space Alignment Teacher-Student framework with a depth-aware teacher and a geometric-space aligned student to address domain shift in BEV perception.
Findings
Achieves state-of-the-art results on three cross-domain BEV detection benchmarks.
Effectively reduces domain gap across multiple geometric feature spaces.
Demonstrates robustness in diverse real-world domain transfer scenarios.
Abstract
Vision-centric bird-eye-view (BEV) perception has shown promising potential in autonomous driving. Recent works mainly focus on improving efficiency or accuracy but neglect the challenges when facing environment changing, resulting in severe degradation of transfer performance. For BEV perception, we figure out the significant domain gaps existing in typical real-world cross-domain scenarios and comprehensively solve the Domain Adaption (DA) problem for multi-view 3D object detection. Since BEV perception approaches are complicated and contain several components, the domain shift accumulation on multiple geometric spaces (i.e., 2D, 3D Voxel, BEV) makes BEV DA even challenging. In this paper, we propose a Multi-space Alignment Teacher-Student (MATS) framework to ease the domain shift accumulation, which consists of a Depth-Aware Teacher (DAT) and a Geometric-space Aligned Student (GAS)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · Robotics and Sensor-Based Localization
MethodsALIGN
