TwistSLAM++: Fusing multiple modalities for accurate dynamic semantic SLAM
Mathieu Gonzalez, Eric Marchand, Amine Kacete, J\'er\^ome Royan

TL;DR
TwistSLAM++ is a multimodal SLAM system that fuses stereo images and LiDAR data with semantic information to improve dynamic object tracking and pose estimation in real-world scenarios.
Contribution
It introduces a novel fusion of stereo images and LiDAR with semantic data for accurate dynamic SLAM, including object shape estimation and pose refinement.
Findings
Improved object tracking accuracy on classical benchmarks.
Effective fusion of stereo and LiDAR data enhances SLAM robustness.
Enhanced object pose and shape estimation in dynamic environments.
Abstract
Most classical SLAM systems rely on the static scene assumption, which limits their applicability in real world scenarios. Recent SLAM frameworks have been proposed to simultaneously track the camera and moving objects. However they are often unable to estimate the canonical pose of the objects and exhibit a low object tracking accuracy. To solve this problem we propose TwistSLAM++, a semantic, dynamic, SLAM system that fuses stereo images and LiDAR information. Using semantic information, we track potentially moving objects and associate them to 3D object detections in LiDAR scans to obtain their pose and size. Then, we perform registration on consecutive object scans to refine object pose estimation. Finally, object scans are used to estimate the shape of the object and constrain map points to lie on the estimated surface within the BA. We show on classical benchmarks that this fusion…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Robotic Path Planning Algorithms · 3D Surveying and Cultural Heritage
