GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis

Srikumar Sastry; Subash Khanal; Aayush Dhakal; Nathan Jacobs

arXiv:2404.06637·cs.CV·April 11, 2024·1 cites

GeoSynth: Contextually-Aware High-Resolution Satellite Image Synthesis

Srikumar Sastry, Subash Khanal, Aayush Dhakal, Nathan Jacobs

PDF

Open Access 1 Repo 2 Models

TL;DR

GeoSynth is a novel model that synthesizes high-resolution satellite images with controllable style and layout, using textual prompts and geographic data, enabling diverse and region-specific image generation.

Contribution

It introduces a new approach combining textual and geographic controls for satellite image synthesis, trained on a large, multi-source dataset for high-quality, diverse outputs.

Findings

01

Effective global style control via text and location

02

High-quality, diverse satellite image generation

03

Strong zero-shot generalization capabilities

Abstract

We present GeoSynth, a model for synthesizing satellite images with global style and image-driven layout control. The global style control is via textual prompts or geographic location. These enable the specification of scene semantics or regional appearance respectively, and can be used together. We train our model on a large dataset of paired satellite imagery, with automatically generated captions, and OpenStreetMap data. We evaluate various combinations of control inputs, including different types of layout controls. Results demonstrate that our model can generate diverse, high-quality images and exhibits excellent zero-shot generalization. The code and model checkpoints are available at https://github.com/mvrl/GeoSynth.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mvrl/geosynth
pytorchOfficial

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Image Retrieval and Classification Techniques · Advanced Vision and Imaging