Guided Generative Models using Weak Supervision for Detecting Object   Spatial Arrangement in Overhead Images

Weiwei Duan; Yao-Yi Chiang; Stefan Leyk; Johannes H. Uhl; Craig A.; Knoblock

arXiv:2112.05786·cs.CV·December 14, 2021

Guided Generative Models using Weak Supervision for Detecting Object Spatial Arrangement in Overhead Images

Weiwei Duan, Yao-Yi Chiang, Stefan Leyk, Johannes H. Uhl, Craig A., Knoblock

PDF

Open Access

TL;DR

This paper introduces TGGM, a weakly supervised generative model based on VAE and GMM, that efficiently estimates spatial arrangements of objects in overhead images with minimal manual annotations.

Contribution

The paper proposes a novel TGGM model that updates GMM components individually within the VAE framework, reducing annotation needs and capturing semantic spatial relationships.

Findings

01

Achieves comparable results to semi-supervised methods.

02

Outperforms unsupervised methods by 10% in F1 score.

03

Requires significantly fewer labeled data.

Abstract

The increasing availability and accessibility of numerous overhead images allows us to estimate and assess the spatial arrangement of groups of geospatial target objects, which can benefit many applications, such as traffic monitoring and agricultural monitoring. Spatial arrangement estimation is the process of identifying the areas which contain the desired objects in overhead images. Traditional supervised object detection approaches can estimate accurate spatial arrangement but require large amounts of bounding box annotations. Recent semi-supervised clustering approaches can reduce manual labeling but still require annotations for all object categories in the image. This paper presents the target-guided generative model (TGGM), under the Variational Auto-encoder (VAE) framework, which uses Gaussian Mixture Models (GMM) to estimate the distributions of both hidden and decoder…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · Automated Road and Building Extraction · Remote-Sensing Image Classification