Joint Semantic Segmentation and Depth Estimation with Deep Convolutional Networks
Arsalan Mousavian, Hamed Pirsiavash, Jana Kosecka

TL;DR
This paper introduces a unified deep CNN model that jointly performs semantic segmentation and depth estimation from a single RGB image, leveraging multi-task learning and CRF for improved accuracy.
Contribution
It presents a novel joint training framework for simultaneous depth and semantic tasks, combining CNNs with CRF to enhance contextual understanding.
Findings
Outperforms state-of-the-art on semantic segmentation
Achieves comparable results on depth estimation
Demonstrates effective multi-task learning with a single loss function
Abstract
Multi-scale deep CNNs have been used successfully for problems mapping each pixel to a label, such as depth estimation and semantic segmentation. It has also been shown that such architectures are reusable and can be used for multiple tasks. These networks are typically trained independently for each task by varying the output layer(s) and training objective. In this work we present a new model for simultaneous depth estimation and semantic segmentation from a single RGB image. Our approach demonstrates the feasibility of training parts of the model for each task and then fine tuning the full, combined model on both tasks simultaneously using a single loss function. Furthermore we couple the deep CNN with fully connected CRF, which captures the contextual relationships and interactions between the semantic and depth cues improving the accuracy of the final results. The proposed model is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Processing Techniques and Applications · Advanced Vision and Imaging · Image and Object Detection Techniques
MethodsConditional Random Field
