Where and What? Examining Interpretable Disentangled Representations
Xinqi Zhu, Chang Xu, Dacheng Tao

TL;DR
This paper explores interpretability in disentangled representations by localizing effects of latent codes and enforcing simple variation capture, leading to improved unsupervised disentanglement without supervision.
Contribution
It introduces a novel approach combining spatial masks and perturbation-based constraints to enhance interpretability in unsupervised disentanglement learning.
Findings
Models learn high-quality disentangled representations
Spatial masks effectively localize latent effects
Unsupervised model selection improves disentanglement quality
Abstract
Capturing interpretable variations has long been one of the goals in disentanglement learning. However, unlike the independence assumption, interpretability has rarely been exploited to encourage disentanglement in the unsupervised setting. In this paper, we examine the interpretability of disentangled representations by investigating two questions: where to be interpreted and what to be interpreted? A latent code is easily to be interpreted if it would consistently impact a certain subarea of the resulting generated image. We thus propose to learn a spatial mask to localize the effect of each individual latent dimension. On the other hand, interpretability usually comes from latent dimensions that capture simple and basic variations in data. We thus impose a perturbation on a certain dimension of the latent code, and expect to identify the perturbation along this dimension from the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Anomaly Detection Techniques and Applications
