LAPTNet-FPN: Multi-scale LiDAR-aided Projective Transform Network for Real Time Semantic Grid Prediction
Manuel Alejandro Diaz-Zapata (CHROMA), David Sierra Gonz\'alez, (CHROMA), \"Ozg\"ur Erkent (CHROMA), Jilles Dibangoye (CHROMA), Christian, Laugier (CHROMA, E-MOTION, Inria)

TL;DR
This paper introduces LAPTNet-FPN, a multi-scale LiDAR-aided network that improves real-time semantic grid prediction by fusing sensor data, achieving significant accuracy gains and 25 FPS performance.
Contribution
The novel multi-scale LiDAR-aided perspective transform network enhances semantic grid prediction accuracy and efficiency in autonomous systems.
Findings
8.67% improvement in human class accuracy
49.07% improvement in movable object class accuracy
25 FPS inference speed
Abstract
Semantic grids can be useful representations of the scene around an autonomous system. By having information about the layout of the space around itself, a robot can leverage this type of representation for crucial tasks such as navigation or tracking. By fusing information from multiple sensors, robustness can be increased and the computational load for the task can be lowered, achieving real time performance. Our multi-scale LiDAR-Aided Perspective Transform network uses information available in point clouds to guide the projection of image features to a top-view representation, resulting in a relative improvement in the state of the art for semantic grid generation for human (+8.67%) and movable object (+49.07%) classes in the nuScenes dataset, as well as achieving results close to the state of the art for the vehicle, drivable area and walkway classes, while performing inference at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Neural Network Applications · 3D Shape Modeling and Analysis
