Depth Not Needed - An Evaluation of RGB-D Feature Encodings for Off-Road   Scene Understanding by Convolutional Neural Network

Christopher J. Holder; Toby P. Breckon; Xiong Wei

arXiv:1801.01235·cs.CV·January 8, 2018

Depth Not Needed - An Evaluation of RGB-D Feature Encodings for Off-Road Scene Understanding by Convolutional Neural Network

Christopher J. Holder, Toby P. Breckon, Xiong Wei

PDF

Open Access

TL;DR

This study evaluates the effectiveness of various RGB-D feature encodings for off-road scene understanding using CNNs, adapting urban scene models to challenging unstructured environments and comparing multiple depth encoding methods.

Contribution

It introduces a systematic comparison of depth encoding techniques for CNN-based off-road scene segmentation, extending urban scene models to unstructured off-road environments.

Findings

01

HHA encoding improves segmentation accuracy over disparity.

02

Normal orientation encoding enhances feature representation.

03

RGB-D input outperforms RGB-only input in off-road scene understanding.

Abstract

Scene understanding for autonomous vehicles is a challenging computer vision task, with recent advances in convolutional neural networks (CNNs) achieving results that notably surpass prior traditional feature driven approaches. However, limited work investigates the application of such methods either within the highly unstructured off-road environment or to RGBD input data. In this work, we take an existing CNN architecture designed to perform semantic segmentation of RGB images of urban road scenes, then adapt and retrain it to perform the same task with multichannel RGBD images obtained under a range of challenging off-road conditions. We compare two different stereo matching algorithms and five different methods of encoding depth information, including disparity, local normal orientation and HHA (horizontal disparity, height above ground plane, angle with gravity), to create a total…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Vision and Imaging · Robotics and Sensor-Based Localization