Seamless High-Resolution Terrain Reconstruction: A Prior-Based Vision Transformer Approach
Osher Rafaeli, Tal Svoray, Ariel Nahlieli

TL;DR
This paper introduces a novel prior-based vision transformer method for high-resolution terrain reconstruction from single-view imagery, significantly surpassing existing super-resolution techniques in resolution and accuracy.
Contribution
The work extends a monocular depth model to remote sensing, integrating low-res SRTM data with high-res NAIP imagery to produce globally consistent DEMs at 30 cm resolution.
Findings
Achieves 100x resolution enhancement over SRTM data.
Demonstrates less than 5 m mean absolute error compared to LiDAR.
Improves hydrological modeling accuracy for hazard assessment.
Abstract
High-resolution elevation data is essential for hydrological modeling, hazard assessment, and environmental monitoring; however, globally consistent, fine-scale Digital Elevation Models (DEMs) remain unavailable. Very high-resolution single-view imagery enables the extraction of topographic information at the pixel level, allowing the reconstruction of fine terrain details over large spatial extents. In this paper, we present single-view-based DEM reconstruction shown to support practical analysis in GIS environments across multiple sub-national jurisdictions. Specifically, we produce high-resolution DEMs for large-scale basins, representing a substantial improvement over the 30 m resolution of globally available Shuttle Radar Topography Mission (SRTM) data. The DEMs are generated using a prior-based monocular depth foundation (MDE) model, extended in this work to the remote sensing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
