Robust Visual SLAM for UAV Navigation in GPS-Denied and Degraded Environments: A Multi-Paradigm Evaluation and Deployment Study
Prasoon Kumar, Akshay Deepak, Sandeep Kumar

TL;DR
This study systematically compares five V-SLAM systems across classical, deep learning, recurrent, and ViT paradigms, evaluating their robustness in degraded environments for UAV navigation.
Contribution
It provides a comprehensive evaluation of diverse V-SLAM systems under various environmental degradations, highlighting their strengths and limitations for UAV deployment.
Findings
Learning-based methods outperform classical ones under severe degradation.
MASt3R achieves the lowest degraded ATE of 0.027 m.
DPVO offers the best efficiency robustness trade-off with 86.1% TSR.
Abstract
Reliable localization in GPS-denied, visually degraded environments is critical for autonomous UAV opera- tions. This paper presents a systematic comparative evaluation of five V-SLAM systems ORB-SLAM3, DPVO, DROID-SLAM, DUSt3R, and MASt3R spanning classical, deep learning, recurrent, and Vision Transformer (ViT) paradigms. Experiments are conducted on curated sequences from four public benchmarks (TUM RGB-D, EuRoC MAV, UMA-VI, SubT-MRS) and a custom monocular indoor dataset under five controlled degradation conditions (normal, low light, dust haze, motion blur, and combined), with sub-millimeter Vicon ground truth. Results show that ORB-SLAM3 fails critically under severe degradation (62.4% overall TSR; 0% under dense haze), while learning-based methods remain robust: MASt3R achieves the lowest degraded ATE (0.027 m) and DUSt3R the highest tracking success (96.5%). DPVO offers the best…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
