AutoGeoLabel: Automated Label Generation for Geospatial Machine Learning

Conrad M Albrecht; Fernando Marianno; Levente J Klein

arXiv:2202.00067·eess.IV·February 2, 2022

AutoGeoLabel: Automated Label Generation for Geospatial Machine Learning

Conrad M Albrecht, Fernando Marianno, Levente J Klein

PDF

TL;DR

This paper presents AutoGeoLabel, a platform-independent method for automatically generating high-accuracy labels for geospatial data using rasterized statistical features, facilitating supervised learning in remote sensing applications.

Contribution

It introduces a novel automated label generation pipeline for geospatial data that achieves high accuracy and is adaptable across different satellite modalities and platforms.

Findings

01

Achieved ~0.9 accuracy in multi-class label generation

02

Demonstrated platform independence and adaptability

03

Validated on dense urban areas with multiple land cover classes

Abstract

A key challenge of supervised learning is the availability of human-labeled data. We evaluate a big data processing pipeline to auto-generate labels for remote sensing data. It is based on rasterized statistical features extracted from surveys such as e.g. LiDAR measurements. Using simple combinations of the rasterized statistical layers, it is demonstrated that multiple classes can be generated at accuracies of ~0.9. As proof of concept, we utilize the big geo-data platform IBM PAIRS to dynamically generate such labels in dense urban areas with multiple land cover classes. The general method proposed here is platform independent, and it can be adapted to generate labels for other satellite modalities in order to enable machine learning on overhead imagery for land use classification and object detection.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.