TL;DR
This paper investigates a multi-task hourglass CNN model that jointly learns semantic segmentation, edge detection, and related tasks, improving spatial accuracy and robustness without post-processing on benchmark datasets.
Contribution
It introduces a multi-task learning framework combining semantic segmentation with edge and contour detection to enhance spatial precision in CNN models.
Findings
Improved segmentation accuracy on Cityscapes, CamVid, and Freiburg Forest datasets.
Multi-task approach outperforms single-task models without post-processing.
Shared latent space enhances feature robustness and spatial detail.
Abstract
The semantic segmentation (SS) task aims to create a dense classification by labeling at the pixel level each object present on images. Convolutional neural network (CNN) approaches have been widely used, and exhibited the best results in this task. However, the loss of spatial precision on the results is a main drawback that has not been solved. In this work, we propose to use a multi-task approach by complementing the semantic segmentation task with edge detection, semantic contour, and distance transform tasks. We propose that by sharing a common latent space, the complementary tasks can produce more robust representations that can enhance the semantic labels. We explore the influence of contour-based tasks on latent space, as well as their impact on the final results of SS. We demonstrate the effectiveness of learning in a multi-task setting for hourglass models in the Cityscapes,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
