Revisiting Factorizing Aggregated Posterior in Learning Disentangled Representations
Ze Cheng, Juncheng Li, Chenxu Wang, Jixuan Gu, Hao Xu, Xinjian Li,, Florian Metze

TL;DR
This paper reveals a theoretical gap in current disentangled representation learning methods that penalize total correlation, showing that low sampled total correlation does not ensure low mean total correlation, and proposes a solution to address this issue.
Contribution
It provides a theoretical analysis of the disparity between sampled and mean representations in disentanglement, and introduces a method to reduce mean total correlation.
Findings
Low total correlation of samples does not guarantee low total correlation of means.
The proposed method effectively reduces mean total correlation, leading to more factorized representations.
The paper discusses limitations of current approaches, highlighting the issue of factor disintegration.
Abstract
In the problem of learning disentangled representations, one of the promising methods is to factorize aggregated posterior by penalizing the total correlation of sampled latent variables. However, this well-motivated strategy has a blind spot: there is a disparity between the sampled latent representation and its corresponding mean representation. In this paper, we provide a theoretical explanation that low total correlation of sampled representation cannot guarantee low total correlation of the mean representation. Indeed, we prove that for the multivariate normal distributions, the mean representation with arbitrarily high total correlation can have a corresponding sampled representation with bounded total correlation. We also propose a method to eliminate this disparity. Experiments show that our model can learn a mean representation with much lower total correlation, hence a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Digital Media Forensic Detection
