Depth Not Needed - An Evaluation of RGB-D Feature Encodings for Off-Road Scene Understanding by Convolutional Neural Network
Christopher J. Holder, Toby P. Breckon, Xiong Wei

TL;DR
This study evaluates the effectiveness of various RGB-D feature encodings for off-road scene understanding using CNNs, adapting urban scene models to challenging unstructured environments and comparing multiple depth encoding methods.
Contribution
It introduces a systematic comparison of depth encoding techniques for CNN-based off-road scene segmentation, extending urban scene models to unstructured off-road environments.
Findings
HHA encoding improves segmentation accuracy over disparity.
Normal orientation encoding enhances feature representation.
RGB-D input outperforms RGB-only input in off-road scene understanding.
Abstract
Scene understanding for autonomous vehicles is a challenging computer vision task, with recent advances in convolutional neural networks (CNNs) achieving results that notably surpass prior traditional feature driven approaches. However, limited work investigates the application of such methods either within the highly unstructured off-road environment or to RGBD input data. In this work, we take an existing CNN architecture designed to perform semantic segmentation of RGB images of urban road scenes, then adapt and retrain it to perform the same task with multichannel RGBD images obtained under a range of challenging off-road conditions. We compare two different stereo matching algorithms and five different methods of encoding depth information, including disparity, local normal orientation and HHA (horizontal disparity, height above ground plane, angle with gravity), to create a total…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Vision and Imaging · Robotics and Sensor-Based Localization
