Li-ViP3D++: Query-Gated Deformable Camera-LiDAR Fusion for End-to-End Perception and Trajectory Prediction
Matej Halinkovic, Nina Masarykova, Alexey Vinel, Marek Galinski

TL;DR
Li-ViP3D++ introduces a query-based, fully differentiable camera-LiDAR fusion framework that enhances end-to-end perception and trajectory prediction for autonomous driving, improving accuracy and robustness while reducing false positives.
Contribution
It proposes a novel Query-Gated Deformable Fusion (QGDF) method for integrating multi-view RGB and LiDAR data in query space, enabling joint optimization of detection, tracking, and trajectory forecasting.
Findings
Improves nuScenes detection mAP to 0.502
Reduces false positive ratio to 0.147
Faster inference time (139.82 ms) than previous models
Abstract
End-to-end perception and trajectory prediction from raw sensor data is one of the key capabilities for autonomous driving. Modular pipelines restrict information flow and can amplify upstream errors. Recent query-based, fully differentiable perception-and-prediction (PnP) models mitigate these issues, yet the complementarity of cameras and LiDAR in the query-space has not been sufficiently explored. Models often rely on fusion schemes that introduce heuristic alignment and discrete selection steps which prevent full utilization of available information and can introduce unwanted bias. We propose Li-ViP3D++, a query-based multimodal PnP framework that introduces Query-Gated Deformable Fusion (QGDF) to integrate multi-view RGB and LiDAR in query space. QGDF (i) aggregates image evidence via masked attention across cameras and feature levels, (ii) extracts LiDAR context through fully…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutonomous Vehicle Technology and Safety · Advanced Neural Network Applications · Robotics and Sensor-Based Localization
