Complex Relations in a Deep Structured Prediction Model for Fine Image Segmentation
Cristina Mata, Guy Ben-Yosef, Boris Katz

TL;DR
This paper enhances deep structured prediction models for fine image segmentation by integrating higher-order relations inspired by human part identification, improving segmentation of complex object parts.
Contribution
It introduces higher-order potentials based on containment and attachment relations into CRF models for better fine-grained segmentation.
Findings
Improved segmentation of fine object parts.
Higher-order relations positively impact segmentation accuracy.
Complex structural features further enhance segmentation results.
Abstract
Many deep learning architectures for semantic segmentation involve a Fully Convolutional Neural Network (FCN) followed by a Conditional Random Field (CRF) to carry out inference over an image. These models typically involve unary potentials based on local appearance features computed by FCNs, and binary potentials based on the displacement between pixels. We show that while current methods succeed in segmenting whole objects, they perform poorly in situations involving a large number of object parts. We therefore suggest incorporating into the inference algorithm additional higher-order potentials inspired by the way humans identify and localize parts. We incorporate two relations that were shown to be useful to human object identification - containment and attachment - into the energy term of the CRF and evaluate their performance on the Pascal VOC Parts dataset. Our experimental…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Domain Adaptation and Few-Shot Learning
MethodsConditional Random Field
