TL;DR
RoCo is an unsupervised iterative framework that improves collaborative perception in autonomous driving by matching objects and adjusting agent poses to mitigate the effects of pose errors, enhancing detection accuracy and robustness.
Contribution
First to model pose correction as an object matching task in collaborative perception, introducing an iterative graph optimization process for pose adjustment.
Findings
Outperforms existing methods in object detection accuracy.
Demonstrates robustness against high-level pose noise.
Effective on both simulated and real-world datasets.
Abstract
Collaborative autonomous driving with multiple vehicles usually requires the data fusion from multiple modalities. To ensure effective fusion, the data from each individual modality shall maintain a reasonably high quality. However, in collaborative perception, the quality of object detection based on a modality is highly sensitive to the relative pose errors among the agents. It leads to feature misalignment and significantly reduces collaborative performance. To address this issue, we propose RoCo, a novel unsupervised framework to conduct iterative object matching and agent pose adjustment. To the best of our knowledge, our work is the first to model the pose correction problem in collaborative perception as an object matching task, which reliably associates common objects detected by different agents. On top of this, we propose a graph optimization process to adjust the agent poses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
