MVS-TTA: Test-Time Adaptation for Multi-View Stereo via Meta-Auxiliary Learning
Hannuo Zhang, Zhixiang Chi, Yang Wang, Xinxin Zuo

TL;DR
MVS-TTA introduces a novel test-time adaptation framework for multi-view stereo that combines meta-learning with self-supervised auxiliary tasks to improve model generalization across diverse datasets.
Contribution
It is the first to integrate optimization-based test-time adaptation into learning-based MVS using meta-learning, enhancing adaptability and performance.
Findings
Consistently improves MVS performance on standard datasets.
Effective in cross-dataset generalization scenarios.
Model-agnostic and easy to integrate with existing MVS methods.
Abstract
Recent learning-based multi-view stereo (MVS) methods are data-driven and have achieved remarkable progress due to large-scale training data and advanced architectures. However, their generalization remains sub-optimal due to fixed model parameters trained on limited training data distributions. In contrast, optimization-based methods enable scene-specific adaptation but lack scalability and require costly per-scene optimization. In this paper, we propose MVS-TTA, an efficient test-time adaptation (TTA) framework that enhances the adaptability of learning-based MVS methods by bridging these two paradigms. Specifically, MVS-TTA employs a self-supervised, cross-view consistency loss as an auxiliary task to guide inference-time adaptation. We introduce a meta-auxiliary learning strategy to train the model to benefit from auxiliary-task-based updates explicitly. Our framework is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Advanced Image Processing Techniques · Image Enhancement Techniques
