Test-Time Adaptation for Height Completion via Self-Supervised ViT Features and Monocular Foundation Models
Osher Rafaeli, Tal Svoray, Ariel Nahlieli

TL;DR
Prior2DSM is a test-time, training-free framework that enhances digital surface model completion by leveraging foundation models and self-supervised features, outperforming traditional and supervised methods.
Contribution
It introduces a novel test-time adaptation approach combining foundation models with lightweight modules for metric DSM completion without task-specific training.
Findings
Achieves up to 46% reduction in RMSE over linear fitting of monocular depth estimates.
Demonstrates consistent improvements over interpolation and prior-based rescaling methods.
Enables DSM updating and coupled RGB-DSM generation.
Abstract
Accurate digital surface models (DSMs) are essential for many geospatial applications, including urban monitoring, environmental analyses, infrastructure management, and change detection. However, large-scale DSMs frequently contain incomplete or outdated regions due to acquisition limitations, reconstruction artifacts, or changes in the built environment. Traditional height completion approaches primarily rely on spatial interpolation or which assume spatial continuity and therefore fail when objects are missing. Recent learning-based approaches improve reconstruction quality but typically require supervised training on sensor-specific datasets, limiting their generalization across domains and sensing conditions. We propose Prior2DSM, a training-free framework for metric DSM completion that operates entirely at test time by leveraging foundation models. Unlike previous height…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
