Learning Geocentric Object Pose in Oblique Monocular Images

Gordon Christie; Rodrigo Rene Rai Munoz Abujder; Kevin Foster; Shea; Hagstrom; Gregory D. Hager; Myron Z. Brown

arXiv:2007.00729·cs.CV·July 3, 2020

Learning Geocentric Object Pose in Oblique Monocular Images

Gordon Christie, Rodrigo Rene Rai Munoz Abujder, Kevin Foster, Shea, Hagstrom, Gregory D. Hager, Myron Z. Brown

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a deep learning method to estimate geocentric object pose from oblique monocular images, improving localization and alignment in Earth observation tasks without relying on monocular depth.

Contribution

The paper presents a novel encoding of geocentric pose and a deep network trained with lidar supervision to estimate object height and orientation from oblique images.

Findings

01

Enhanced accuracy in object localization and image alignment.

02

Effective extension of semantic segmentation datasets for satellite imagery.

03

Public availability of data and code for reproducibility.

Abstract

An object's geocentric pose, defined as the height above ground and orientation with respect to gravity, is a powerful representation of real-world structure for object detection, segmentation, and localization tasks using RGBD images. For close-range vision tasks, height and orientation have been derived directly from stereo-computed depth and more recently from monocular depth predicted by deep networks. For long-range vision tasks such as Earth observation, depth cannot be reliably estimated with monocular images. Inspired by recent work in monocular height above ground prediction and optical flow prediction from static images, we develop an encoding of geocentric pose to address this challenge and train a deep network to compute the representation densely, supervised by publicly available airborne lidar. We exploit these attributes to rectify oblique images and remove observed…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

pubgeo/monocular-geocentric-pose
tfOfficial

Videos

Learning Geocentric Object Pose in Oblique Monocular Images· youtube

Taxonomy

TopicsRobotics and Sensor-Based Localization · Advanced Vision and Imaging · Advanced Neural Network Applications