Unsupervised Part Segmentation through Disentangling Appearance and   Shape

Shilong Liu; Lei Zhang; Xiao Yang; Hang Su; Jun Zhu

arXiv:2105.12405·cs.CV·May 27, 2021

Unsupervised Part Segmentation through Disentangling Appearance and Shape

Shilong Liu, Lei Zhang, Xiao Yang, Hang Su, Jun Zhu

PDF

Open Access

TL;DR

This paper introduces an unsupervised method for object part segmentation that disentangles appearance and shape without relying on annotated masks, improving interpretability and segmentation consistency across diverse objects.

Contribution

The authors propose a novel disentanglement approach using reconstruction losses and a bottleneck block to enhance unsupervised part segmentation without additional mask data.

Findings

01

Effective segmentation on faces, birds, and PASCAL VOC objects.

02

Improved semantic consistency of segmented parts.

03

Outperforms previous unsupervised methods in accuracy.

Abstract

We study the problem of unsupervised discovery and segmentation of object parts, which, as an intermediate local representation, are capable of finding intrinsic object structure and providing more explainable recognition results. Recent unsupervised methods have greatly relaxed the dependency on annotated data which are costly to obtain, but still rely on additional information such as object segmentation mask or saliency map. To remove such a dependency and further improve the part segmentation performance, we develop a novel approach by disentangling the appearance and shape representations of object parts followed with reconstruction losses without using additional object mask information. To avoid degenerated solutions, a bottleneck block is designed to squeeze and expand the appearance representation, leading to a more effective disentanglement between geometry and appearance.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFace recognition and analysis · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection