Geodiffussr: Generative Terrain Texturing with Elevation Fidelity
Tai Inui, Alexander Matsumura, Edgar Simo-Serra

TL;DR
Geodiffussr is a novel flow-matching pipeline that synthesizes terrain textures guided by text and strictly adheres to elevation data, significantly improving visual fidelity and elevation consistency in large-scale terrain generation.
Contribution
We introduce Geodiffussr, a new method combining multi-scale content aggregation with text-guided synthesis for terrain texturing aligned with elevation maps.
Findings
MCA improves visual fidelity over baseline
Height-appearance coupling is significantly enhanced
The method provides a controllable approach for terrain generation
Abstract
Large-scale terrain generation remains a labor-intensive task in computer graphics. We introduce Geodiffussr, a flow-matching pipeline that synthesizes text-guided texture maps while strictly adhering to a supplied Digital Elevation Map (DEM). The core mechanism is multi-scale content aggregation (MCA): DEM features from a pretrained encoder are injected into UNet blocks at multiple resolutions to enforce global-to-local elevation consistency. Compared with a non-MCA baseline, MCA markedly improves visual fidelity and strengthens height-appearance coupling (FID 49.16%, LPIPS 32.33%, dCor to 0.0016). To train and evaluate Geodiffussr, we assemble a globally distributed, biome- and climate-stratified corpus of triplets pairing SRTM-derived DEMs with Sentinel-2 imagery and vision-grounded natural-language captions that describe visible land…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputer Graphics and Visualization Techniques · Generative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis
