Seamless High-Resolution Terrain Reconstruction: A Prior-Based Vision Transformer Approach

Osher Rafaeli; Tal Svoray; Ariel Nahlieli

arXiv:2507.09681·cs.CV·April 3, 2026

Seamless High-Resolution Terrain Reconstruction: A Prior-Based Vision Transformer Approach

Osher Rafaeli, Tal Svoray, Ariel Nahlieli

PDF

TL;DR

This paper introduces a novel prior-based vision transformer method for high-resolution terrain reconstruction from single-view imagery, significantly surpassing existing super-resolution techniques in resolution and accuracy.

Contribution

The work extends a monocular depth model to remote sensing, integrating low-res SRTM data with high-res NAIP imagery to produce globally consistent DEMs at 30 cm resolution.

Findings

01

Achieves 100x resolution enhancement over SRTM data.

02

Demonstrates less than 5 m mean absolute error compared to LiDAR.

03

Improves hydrological modeling accuracy for hazard assessment.

Abstract

High-resolution elevation data is essential for hydrological modeling, hazard assessment, and environmental monitoring; however, globally consistent, fine-scale Digital Elevation Models (DEMs) remain unavailable. Very high-resolution single-view imagery enables the extraction of topographic information at the pixel level, allowing the reconstruction of fine terrain details over large spatial extents. In this paper, we present single-view-based DEM reconstruction shown to support practical analysis in GIS environments across multiple sub-national jurisdictions. Specifically, we produce high-resolution DEMs for large-scale basins, representing a substantial improvement over the 30 m resolution of globally available Shuttle Radar Topography Mission (SRTM) data. The DEMs are generated using a prior-based monocular depth foundation (MDE) model, extended in this work to the remote sensing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.