IV-SLAM: Introspective Vision for Simultaneous Localization and Mapping
Sadegh Rabiee, Joydeep Biswas

TL;DR
IV-SLAM introduces a context-aware noise modeling approach for visual SLAM, improving robustness and reducing tracking failures in challenging conditions by guiding feature extraction and error estimation.
Contribution
The paper proposes a novel introspective vision framework that models reprojection error noise as context-dependent, enabling more robust feature selection and error handling in SLAM.
Findings
Reduces tracking error compared to traditional V-SLAM
Accurately predicts sources of error in input images
Increases mean distance between tracking failures by over 70%
Abstract
Existing solutions to visual simultaneous localization and mapping (V-SLAM) assume that errors in feature extraction and matching are independent and identically distributed (i.i.d), but this assumption is known to not be true -- features extracted from low-contrast regions of images exhibit wider error distributions than features from sharp corners. Furthermore, V-SLAM algorithms are prone to catastrophic tracking failures when sensed images include challenging conditions such as specular reflections, lens flare, or shadows of dynamic objects. To address such failures, previous work has focused on building more robust visual frontends, to filter out challenging features. In this paper, we present introspective vision for SLAM (IV-SLAM), a fundamentally different approach for addressing these challenges. IV-SLAM explicitly models the noise process of reprojection errors from visual…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRobotics and Sensor-Based Localization · Advanced Image and Video Retrieval Techniques · Indoor and Outdoor Localization Technologies
