Robust Collaborative 3D Object Detection in Presence of Pose Errors
Yifan Lu, Quanhao Li, Baoan Liu, Mehrdad Dianati, Chen Feng, Siheng, Chen, Yanfeng Wang

TL;DR
CoAlign is a robust hybrid framework for collaborative 3D object detection that effectively mitigates pose estimation errors without requiring ground-truth pose supervision, improving detection accuracy in practical scenarios.
Contribution
The paper introduces CoAlign, a novel agent-object pose graph and multi-scale data fusion approach that enhances robustness to pose errors in collaborative 3D detection.
Findings
Significantly reduces localization error under pose inaccuracies
Achieves state-of-the-art detection performance with pose errors
Does not require ground-truth pose supervision during training
Abstract
Collaborative 3D object detection exploits information exchange among multiple agents to enhance accuracy of object detection in presence of sensor impairments such as occlusion. However, in practice, pose estimation errors due to imperfect localization would cause spatial message misalignment and significantly reduce the performance of collaboration. To alleviate adverse impacts of pose errors, we propose CoAlign, a novel hybrid collaboration framework that is robust to unknown pose errors. The proposed solution relies on a novel agent-object pose graph modeling to enhance pose consistency among collaborating agents. Furthermore, we adopt a multi-scale data fusion strategy to aggregate intermediate features at multiple spatial resolutions. Comparing with previous works, which require ground-truth pose for training supervision, our proposed CoAlign is more practical since it doesn't…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Face recognition and analysis
