SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for   Aerial Semantic Segmentation

Aysim Toker; Marvin Eisenberger; Daniel Cremers; Laura Leal-Taix\'e

arXiv:2403.16605·cs.CV·March 26, 2024·1 cites

SatSynth: Augmenting Image-Mask Pairs through Diffusion Models for Aerial Semantic Segmentation

Aysim Toker, Marvin Eisenberger, Daniel Cremers, Laura Leal-Taix\'e

PDF

Open Access

TL;DR

This paper introduces SatSynth, a diffusion model-based method for generating paired satellite images and masks to augment training data, significantly improving semantic segmentation performance in earth observation tasks.

Contribution

First to generate high-quality, diverse image-mask pairs for satellite segmentation using diffusion models, enhancing data augmentation strategies.

Findings

01

Generated pairs show high quality and diversity.

02

Augmentation with generated data improves segmentation accuracy.

03

Outperforms prior generative methods like GANs in this context.

Abstract

In recent years, semantic segmentation has become a pivotal tool in processing and interpreting satellite imagery. Yet, a prevalent limitation of supervised learning techniques remains the need for extensive manual annotations by experts. In this work, we explore the potential of generative image diffusion to address the scarcity of annotated data in earth observation tasks. The main idea is to learn the joint data manifold of images and labels, leveraging recent advancements in denoising diffusion probabilistic models. To the best of our knowledge, we are the first to generate both images and corresponding masks for satellite segmentation. We find that the obtained pairs not only display high quality in fine-scale features but also ensure a wide sampling diversity. Both aspects are crucial for earth observation data, where semantic classes can vary severely in scale and occurrence…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Multimodal Machine Learning Applications · Advanced Image and Video Retrieval Techniques

MethodsDiffusion