StreamLTS: Query-based Temporal-Spatial LiDAR Fusion for Cooperative Object Detection
Yunshuang Yuan, Monika Sester

TL;DR
This paper introduces TA-COOD, a novel framework for cooperative object detection that accounts for asynchronous LiDAR sensor timings, improving accuracy by modeling temporal object information with query-based techniques.
Contribution
It proposes a fully sparse, query-based framework that considers asynchronous sensor data, addressing a key gap in existing cooperative perception methods.
Findings
Superior efficiency over dense models.
Point-wise timestamps are crucial for accuracy.
Effective modeling of temporal object context.
Abstract
Cooperative perception via communication among intelligent traffic agents has great potential to improve the safety of autonomous driving. However, limited communication bandwidth, localization errors and asynchronized capturing time of sensor data, all introduce difficulties to the data fusion of different agents. To some extend, previous works have attempted to reduce the shared data size, mitigate the spatial feature misalignment caused by localization errors and communication delay. However, none of them have considered the asynchronized sensor ticking times, which can lead to dynamic object misplacement of more than one meter during data fusion. In this work, we propose Time-Aligned COoperative Object Detection (TA-COOD), for which we adapt widely used dataset OPV2V and DairV2X with considering asynchronous LiDAR sensor ticking times and build an efficient fully sparse framework…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Video Surveillance and Tracking Methods · Remote Sensing and LiDAR Applications
