TL;DR
This paper introduces a novel dynamic visual SLAM method tailored for outdoor construction sites with large moving objects, utilizing hierarchical masking and motion-state classification to improve robustness and efficiency.
Contribution
It presents a new motion segmentation approach combining semantic and geometric cues, enabling effective static scene extraction for robust SLAM in dynamic environments.
Findings
Outperforms standard visual SLAM in dynamic scenes
Achieves real-time processing on construction site datasets
Maintains accurate camera tracking despite large dynamic occlusions
Abstract
At modern construction sites, utilizing GNSS (Global Navigation Satellite System) to measure the real-time location and orientation (i.e. pose) of construction machines and navigate them is very common. However, GNSS is not always available. Replacing GNSS with on-board cameras and visual simultaneous localization and mapping (visual SLAM) to navigate the machines is a cost-effective solution. Nevertheless, at construction sites, multiple construction machines will usually work together and side-by-side, causing large dynamic occlusions in the cameras' view. Standard visual SLAM cannot handle large dynamic occlusions well. In this work, we propose a motion segmentation method to efficiently extract static parts from crowded dynamic scenes to enable robust tracking of camera ego-motion. Our method utilizes semantic information combined with object-level geometric constraints to quickly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
