Complex Relations in a Deep Structured Prediction Model for Fine Image   Segmentation

Cristina Mata; Guy Ben-Yosef; Boris Katz

arXiv:1805.09462·cs.CV·May 25, 2018

Complex Relations in a Deep Structured Prediction Model for Fine Image Segmentation

Cristina Mata, Guy Ben-Yosef, Boris Katz

PDF

Open Access

TL;DR

This paper enhances deep structured prediction models for fine image segmentation by integrating higher-order relations inspired by human part identification, improving segmentation of complex object parts.

Contribution

It introduces higher-order potentials based on containment and attachment relations into CRF models for better fine-grained segmentation.

Findings

01

Improved segmentation of fine object parts.

02

Higher-order relations positively impact segmentation accuracy.

03

Complex structural features further enhance segmentation results.

Abstract

Many deep learning architectures for semantic segmentation involve a Fully Convolutional Neural Network (FCN) followed by a Conditional Random Field (CRF) to carry out inference over an image. These models typically involve unary potentials based on local appearance features computed by FCNs, and binary potentials based on the displacement between pixels. We show that while current methods succeed in segmenting whole objects, they perform poorly in situations involving a large number of object parts. We therefore suggest incorporating into the inference algorithm additional higher-order potentials inspired by the way humans identify and localize parts. We incorporate two relations that were shown to be useful to human object identification - containment and attachment - into the energy term of the CRF and evaluate their performance on the Pascal VOC Parts dataset. Our experimental…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning

MethodsConditional Random Field