Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation   Task

Darwin Saire; Ad\'in Ram\'irez Rivera

arXiv:2105.13531·cs.CV·May 31, 2021

Empirical Study of Multi-Task Hourglass Model for Semantic Segmentation Task

Darwin Saire, Ad\'in Ram\'irez Rivera

PDF

1 Repo

TL;DR

This paper investigates a multi-task hourglass CNN model that jointly learns semantic segmentation, edge detection, and related tasks, improving spatial accuracy and robustness without post-processing on benchmark datasets.

Contribution

It introduces a multi-task learning framework combining semantic segmentation with edge and contour detection to enhance spatial precision in CNN models.

Findings

01

Improved segmentation accuracy on Cityscapes, CamVid, and Freiburg Forest datasets.

02

Multi-task approach outperforms single-task models without post-processing.

03

Shared latent space enhances feature robustness and spatial detail.

Abstract

The semantic segmentation (SS) task aims to create a dense classification by labeling at the pixel level each object present on images. Convolutional neural network (CNN) approaches have been widely used, and exhibited the best results in this task. However, the loss of spatial precision on the results is a main drawback that has not been solved. In this work, we propose to use a multi-task approach by complementing the semantic segmentation task with edge detection, semantic contour, and distance transform tasks. We propose that by sharing a common latent space, the complementary tasks can produce more robust representations that can enhance the semantic labels. We explore the influence of contour-based tasks on latent space, as well as their impact on the final results of SS. We demonstrate the effectiveness of learning in a multi-task setting for hourglass models in the Cityscapes,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://gitlab.com/mipl/mtl-ss
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.