ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic   Segmentation

Jinming Cao; Hanchao Leng; Dani Lischinski; Danny Cohen-Or; Changhe; Tu; Yangyan Li

arXiv:2108.10528·cs.CV·August 25, 2021·1 cites

ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation

Jinming Cao, Hanchao Leng, Dani Lischinski, Danny Cohen-Or, Changhe, Tu, Yangyan Li

PDF

Open Access 1 Repo

TL;DR

This paper introduces ShapeConv, a shape-aware convolutional layer that decomposes depth features into shape and base components, enhancing indoor RGB-D semantic segmentation accuracy without increasing inference complexity.

Contribution

The paper proposes a novel ShapeConv layer that explicitly models shape and base components of depth features, improving segmentation performance and seamlessly integrating into existing CNNs.

Findings

01

Significant improvement on NYU-Dv2, SUN RGB-D, and SID benchmarks.

02

Enhanced segmentation accuracy without additional inference cost.

03

Model-agnostic approach compatible with various CNN architectures.

Abstract

RGB-D semantic segmentation has attracted increasing attention over the past few years. Existing methods mostly employ homogeneous convolution operators to consume the RGB and depth features, ignoring their intrinsic differences. In fact, the RGB values capture the photometric appearance properties in the projected image space, while the depth feature encodes both the shape of a local geometry as well as the base (whereabout) of it in a larger context. Compared with the base, the shape probably is more inherent and has a stronger connection to the semantics, and thus is more critical for segmentation accuracy. Inspired by this observation, we introduce a Shape-aware Convolutional layer (ShapeConv) for processing the depth feature, where the depth feature is firstly decomposed into a shape-component and a base-component, next two learnable weights are introduced to cooperate with them…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hanchaoleng/shapeconv
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Video Surveillance and Tracking Methods

MethodsShapeConv · Convolution