X-PDNet: Accurate Joint Plane Instance Segmentation and Monocular Depth Estimation with Cross-Task Distillation and Boundary Correction
Cao Dinh Duc, Jongwoo Lim

TL;DR
X-PDNet is a novel multitask learning framework that enhances plane instance segmentation and depth estimation from single RGB images by incorporating cross-task distillation and boundary-aware depth-guided boundary refinement.
Contribution
The paper introduces cross-task feature distillation and a depth-based boundary regression loss to improve joint plane segmentation and depth estimation.
Findings
Outperforms baseline methods with significant improvements.
Effectively improves boundary region segmentation accuracy.
Demonstrates robustness on ScanNet and Stanford 2D-3D-S datasets.
Abstract
Segmentation of planar regions from a single RGB image is a particularly important task in the perception of complex scenes. To utilize both visual and geometric properties in images, recent approaches often formulate the problem as a joint estimation of planar instances and dense depth through feature fusion mechanisms and geometric constraint losses. Despite promising results, these methods do not consider cross-task feature distillation and perform poorly in boundary regions. To overcome these limitations, we propose X-PDNet, a framework for the multitask learning of plane instance segmentation and depth estimation with improvements in the following two aspects. Firstly, we construct the cross-task distillation design which promotes early information sharing between dual-tasks for specific task improvements. Secondly, we highlight the current limitations of using the ground truth…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Image and Object Detection Techniques
