Geodiffussr: Generative Terrain Texturing with Elevation Fidelity

Tai Inui; Alexander Matsumura; Edgar Simo-Serra

arXiv:2511.23029·cs.GR·December 1, 2025

Geodiffussr: Generative Terrain Texturing with Elevation Fidelity

Tai Inui, Alexander Matsumura, Edgar Simo-Serra

PDF

Open Access

TL;DR

Geodiffussr is a novel flow-matching pipeline that synthesizes terrain textures guided by text and strictly adheres to elevation data, significantly improving visual fidelity and elevation consistency in large-scale terrain generation.

Contribution

We introduce Geodiffussr, a new method combining multi-scale content aggregation with text-guided synthesis for terrain texturing aligned with elevation maps.

Findings

01

MCA improves visual fidelity over baseline

02

Height-appearance coupling is significantly enhanced

03

The method provides a controllable approach for terrain generation

Abstract

Large-scale terrain generation remains a labor-intensive task in computer graphics. We introduce Geodiffussr, a flow-matching pipeline that synthesizes text-guided texture maps while strictly adhering to a supplied Digital Elevation Map (DEM). The core mechanism is multi-scale content aggregation (MCA): DEM features from a pretrained encoder are injected into UNet blocks at multiple resolutions to enforce global-to-local elevation consistency. Compared with a non-MCA baseline, MCA markedly improves visual fidelity and strengthens height-appearance coupling (FID $↓$ 49.16%, LPIPS $↓$ 32.33%, $Δ$ dCor $↓$ to 0.0016). To train and evaluate Geodiffussr, we assemble a globally distributed, biome- and climate-stratified corpus of triplets pairing SRTM-derived DEMs with Sentinel-2 imagery and vision-grounded natural-language captions that describe visible land…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis