Guided Generative Models using Weak Supervision for Detecting Object Spatial Arrangement in Overhead Images
Weiwei Duan, Yao-Yi Chiang, Stefan Leyk, Johannes H. Uhl, Craig A., Knoblock

TL;DR
This paper introduces TGGM, a weakly supervised generative model based on VAE and GMM, that efficiently estimates spatial arrangements of objects in overhead images with minimal manual annotations.
Contribution
The paper proposes a novel TGGM model that updates GMM components individually within the VAE framework, reducing annotation needs and capturing semantic spatial relationships.
Findings
Achieves comparable results to semi-supervised methods.
Outperforms unsupervised methods by 10% in F1 score.
Requires significantly fewer labeled data.
Abstract
The increasing availability and accessibility of numerous overhead images allows us to estimate and assess the spatial arrangement of groups of geospatial target objects, which can benefit many applications, such as traffic monitoring and agricultural monitoring. Spatial arrangement estimation is the process of identifying the areas which contain the desired objects in overhead images. Traditional supervised object detection approaches can estimate accurate spatial arrangement but require large amounts of bounding box annotations. Recent semi-supervised clustering approaches can reduce manual labeling but still require annotations for all object categories in the image. This paper presents the target-guided generative model (TGGM), under the Variational Auto-encoder (VAE) framework, which uses Gaussian Mixture Models (GMM) to estimate the distributions of both hidden and decoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image and Video Retrieval Techniques · Automated Road and Building Extraction · Remote-Sensing Image Classification
