Improving Spatial Codification in Semantic Segmentation

Carles Ventura; Xavier Gir\'o-i-Nieto; Ver\'onica Vilaplana; Kevin; McGuinness; Ferran Marqu\'es; Noel E. O'Connor

arXiv:1505.07409·cs.CV·November 15, 2016

Improving Spatial Codification in Semantic Segmentation

Carles Ventura, Xavier Gir\'o-i-Nieto, Ver\'onica Vilaplana, Kevin, McGuinness, Ferran Marqu\'es, Noel E. O'Connor

PDF

TL;DR

This paper introduces a new spatial codification method for semantic segmentation that partitions images into figure, border, and ground regions, and applies spatial pyramids to enhance object description, leading to improved segmentation results.

Contribution

It proposes a novel spatial partitioning and pyramid-based descriptor that reduces context influence and enhances object representation in semantic segmentation.

Findings

01

Improved segmentation accuracy on Pascal VOC datasets.

02

Enhanced object description with spatial pyramids.

03

Better Figure-Ground pooling performance.

Abstract

This paper explores novel approaches for improving the spatial codification for the pooling of local descriptors to solve the semantic segmentation problem. We propose to partition the image into three regions for each object to be described: Figure, Border and Ground. This partition aims at minimizing the influence of the image context on the object description and vice versa by introducing an intermediate zone around the object contour. Furthermore, we also propose a richer visual descriptor of the object by applying a Spatial Pyramid over the Figure region. Two novel Spatial Pyramid configurations are explored: Cartesian-based and crown-based Spatial Pyramids. We test these approaches with state-of-the-art techniques and show that they improve the Figure-Ground based pooling in the Pascal VOC 2011 and 2012 semantic segmentation challenges.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.