ShapeConv: Shape-aware Convolutional Layer for Indoor RGB-D Semantic Segmentation
Jinming Cao, Hanchao Leng, Dani Lischinski, Danny Cohen-Or, Changhe, Tu, Yangyan Li

TL;DR
This paper introduces ShapeConv, a shape-aware convolutional layer that decomposes depth features into shape and base components, enhancing indoor RGB-D semantic segmentation accuracy without increasing inference complexity.
Contribution
The paper proposes a novel ShapeConv layer that explicitly models shape and base components of depth features, improving segmentation performance and seamlessly integrating into existing CNNs.
Findings
Significant improvement on NYU-Dv2, SUN RGB-D, and SID benchmarks.
Enhanced segmentation accuracy without additional inference cost.
Model-agnostic approach compatible with various CNN architectures.
Abstract
RGB-D semantic segmentation has attracted increasing attention over the past few years. Existing methods mostly employ homogeneous convolution operators to consume the RGB and depth features, ignoring their intrinsic differences. In fact, the RGB values capture the photometric appearance properties in the projected image space, while the depth feature encodes both the shape of a local geometry as well as the base (whereabout) of it in a larger context. Compared with the base, the shape probably is more inherent and has a stronger connection to the semantics, and thus is more critical for segmentation accuracy. Inspired by this observation, we introduce a Shape-aware Convolutional layer (ShapeConv) for processing the depth feature, where the depth feature is firstly decomposed into a shape-component and a base-component, next two learnable weights are introduced to cooperate with them…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Industrial Vision Systems and Defect Detection · Video Surveillance and Tracking Methods
MethodsShapeConv · Convolution
