Object-aware Monocular Depth Prediction with Instance Convolutions
Enis Simsar, Evin P{\i}nar \"Ornek, Fabian Manhardt, Helisa Dhamo,, Nassir Navab, Federico Tombari

TL;DR
This paper introduces Instance Convolution, a novel operator for monocular depth prediction that improves boundary accuracy by focusing on object parts via superpixels, outperforming classical convolutions especially at object edges.
Contribution
The paper proposes Instance Convolution, a new convolutional operator that explicitly avoids feature mixing across object boundaries using superpixels, enhancing depth estimation accuracy at discontinuities.
Findings
Superior boundary depth estimation on NYUv2 and iBims datasets
Comparable performance to classical convolution elsewhere
Effective handling of occlusion boundaries
Abstract
With the advent of deep learning, estimating depth from a single RGB image has recently received a lot of attention, being capable of empowering many different applications ranging from path planning for robotics to computational cinematography. Nevertheless, while the depth maps are in their entirety fairly reliable, the estimates around object discontinuities are still far from satisfactory. This can be contributed to the fact that the convolutional operator naturally aggregates features across object discontinuities, resulting in smooth transitions rather than clear boundaries. Therefore, in order to circumvent this issue, we propose a novel convolutional operator which is explicitly tailored to avoid feature aggregation of different object parts. In particular, our method is based on estimating per-part depth values by means of superpixels. The proposed convolutional operator, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Optical measurement and interference techniques
MethodsConvolution
