Joint Object and Part Segmentation using Deep Learned Potentials

Peng Wang; Xiaohui Shen; Zhe Lin; Scott Cohen; Brian Price; Alan; Yuille

arXiv:1505.00276·cs.CV·May 5, 2015·27 cites

Joint Object and Part Segmentation using Deep Learned Potentials

Peng Wang, Xiaohui Shen, Zhe Lin, Scott Cohen, Brian Price, Alan, Yuille

PDF

Open Access

TL;DR

This paper presents a joint object and part segmentation method that leverages deep learned potentials and a fully connected CRF to improve accuracy by utilizing object-level context and detailed part localization.

Contribution

It introduces semantic compositional parts (SCP) and a two-channel FCN for joint segmentation, enhancing performance over existing methods.

Findings

01

Outperforms state-of-the-art on three datasets

02

Mutually improves object and part segmentation accuracy

03

Effectively utilizes long-range context for detailed parsing

Abstract

Segmenting semantic objects from images and parsing them into their respective semantic parts are fundamental steps towards detailed object understanding in computer vision. In this paper, we propose a joint solution that tackles semantic object and part segmentation simultaneously, in which higher object-level context is provided to guide part segmentation, and more detailed part-level localization is utilized to refine object segmentation. Specifically, we first introduce the concept of semantic compositional parts (SCP) in which similar semantic parts are grouped and shared among different objects. A two-channel fully convolutional network (FCN) is then trained to provide the SCP and object potentials at each pixel. At the same time, a compact set of segments can also be obtained from the SCP predictions of the network. Given the potentials and the generated segments, in order to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Advanced Image and Video Retrieval Techniques · Multimodal Machine Learning Applications