Unsupervised Part Segmentation through Disentangling Appearance and Shape
Shilong Liu, Lei Zhang, Xiao Yang, Hang Su, Jun Zhu

TL;DR
This paper introduces an unsupervised method for object part segmentation that disentangles appearance and shape without relying on annotated masks, improving interpretability and segmentation consistency across diverse objects.
Contribution
The authors propose a novel disentanglement approach using reconstruction losses and a bottleneck block to enhance unsupervised part segmentation without additional mask data.
Findings
Effective segmentation on faces, birds, and PASCAL VOC objects.
Improved semantic consistency of segmented parts.
Outperforms previous unsupervised methods in accuracy.
Abstract
We study the problem of unsupervised discovery and segmentation of object parts, which, as an intermediate local representation, are capable of finding intrinsic object structure and providing more explainable recognition results. Recent unsupervised methods have greatly relaxed the dependency on annotated data which are costly to obtain, but still rely on additional information such as object segmentation mask or saliency map. To remove such a dependency and further improve the part segmentation performance, we develop a novel approach by disentangling the appearance and shape representations of object parts followed with reconstruction losses without using additional object mask information. To avoid degenerated solutions, a bottleneck block is designed to squeeze and expand the appearance representation, leading to a more effective disentanglement between geometry and appearance.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFace recognition and analysis · Advanced Image and Video Retrieval Techniques · Visual Attention and Saliency Detection
