Multi-Modality Driven LoRA for Adverse Condition Depth Estimation
Guanglei Yang, Rui Tian, Yongqiang Zhang, Zhun Zhong, Yongqiang Li,, Wangmeng Zuo

TL;DR
This paper introduces MMD-LoRA, a multi-modality approach combining prompt-driven domain alignment and contrastive learning to improve adverse condition depth estimation, achieving state-of-the-art results efficiently.
Contribution
It proposes a novel multi-modality framework with low-rank adaptation for domain transfer and multimodal feature alignment in adverse weather depth estimation.
Findings
Achieves state-of-the-art performance on nuScenes and Oxford RobotCar datasets.
Demonstrates robustness and efficiency in adverse environmental conditions.
Effectively aligns multimodal features for better depth estimation under adverse weather.
Abstract
The autonomous driving community is increasingly focused on addressing corner case problems, particularly those related to ensuring driving safety under adverse conditions (e.g., nighttime, fog, rain). To this end, the task of Adverse Condition Depth Estimation (ACDE) has gained significant attention. Previous approaches in ACDE have primarily relied on generative models, which necessitate additional target images to convert the sunny condition into adverse weather, or learnable parameters for feature augmentation to adapt domain gaps, resulting in increased model complexity and tuning efforts. Furthermore, unlike CLIP-based methods where textual and visual features have been pre-aligned, depth estimation models lack sufficient alignment between multimodal features, hindering coherent understanding under adverse conditions. To address these limitations, we propose Multi-Modality Driven…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFault Detection and Control Systems · Anomaly Detection Techniques and Applications · Industrial Vision Systems and Defect Detection
MethodsDiffusion · Contrastive Language-Image Pre-training
