Enhancing Generalization of Depth Estimation Foundation Model via Weakly-Supervised Adaptation with Regularization
Yan Huang, Yongyi Su, Xin Lin, Le Zhang, Xun Xu

TL;DR
This paper introduces WeSTAR, a novel weakly-supervised adaptation framework that enhances the robustness and generalization of depth estimation foundation models across diverse and unseen domains.
Contribution
We propose a parameter-efficient adaptation method combining dense self-training, semantically-aware normalization, and weak supervision with regularization to improve depth estimation models.
Findings
WeSTAR outperforms existing methods on various out-of-distribution datasets.
The framework achieves state-of-the-art results across multiple benchmarks.
It effectively mitigates topological errors and maintains model stability.
Abstract
The emergence of foundation models has substantially advanced zero-shot generalization in monocular depth estimation (MDE), as exemplified by the Depth Anything series. However, given access to some data from downstream tasks, a natural question arises: can the performance of these models be further improved? To this end, we propose WeSTAR, a parameter-efficient framework that performs Weakly supervised Self-Training Adaptation with Regularization, designed to enhance the robustness of MDE foundation models in unseen and diverse domains. We first adopt a dense self-training objective as the primary source of structural self-supervision. To further improve robustness, we introduce semantically-aware hierarchical normalization, which exploits instance-level segmentation maps to perform more stable and multi-scale structural normalization. Beyond dense supervision, we introduce a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Vision and Imaging · Advanced Neural Network Applications
