VectorSynth: Fine-Grained Satellite Image Synthesis with Structured Semantics
Daniel Cher, Brian Wei, Srikumar Sastry, Nathan Jacobs

TL;DR
VectorSynth is a diffusion-based framework for satellite image synthesis that uses polygonal geographic annotations and semantic attributes for pixel-accurate, spatially grounded image editing and generation.
Contribution
It introduces a novel cross-modal correspondence learning approach that aligns imagery with semantic vector geometry for fine-grained, spatially aware satellite image synthesis.
Findings
Outperforms prior methods in semantic fidelity and realism
Supports interactive spatial edits and map-informed content generation
Demonstrates effective fine-grained spatial grounding with a vision language model
Abstract
We introduce VectorSynth, a diffusion-based framework for pixel-accurate satellite image synthesis conditioned on polygonal geographic annotations with semantic attributes. Unlike prior text- or layout-conditioned models, VectorSynth learns dense cross-modal correspondences that align imagery and semantic vector geometry, enabling fine-grained, spatially grounded edits. A vision language alignment module produces pixel-level embeddings from polygon semantics; these embeddings guide a conditional image generation framework to respect both spatial extents and semantic cues. VectorSynth supports interactive workflows that mix language prompts with geometry-aware conditioning, allowing rapid what-if simulations, spatial edits, and map-informed content generation. For training and evaluation, we assemble a collection of satellite scenes paired with pixel-registered polygon annotations…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Geographic Information Systems Studies · Robotics and Sensor-Based Localization
