Robust Disentanglement of a Few Factors at a Time
Benjamin Estermann, Markus Marks, Mehmet Fatih Yanik

TL;DR
This paper presents a recursive training method for variational autoencoders that improves unsupervised disentanglement of data factors by iteratively removing learned factors and retraining, leading to state-of-the-art results.
Contribution
It introduces the recursive rPU-VAE approach and the use of Unsupervised Disentanglement Ranking to enhance and stabilize unsupervised disentanglement.
Findings
Significant improvement in disentanglement performance.
Enhanced robustness across datasets.
Effective iterative removal of learned factors.
Abstract
Disentanglement is at the forefront of unsupervised learning, as disentangled representations of data improve generalization, interpretability, and performance in downstream tasks. Current unsupervised approaches remain inapplicable for real-world datasets since they are highly variable in their performance and fail to reach levels of disentanglement of (semi-)supervised approaches. We introduce population-based training (PBT) for improving consistency in training variational autoencoders (VAEs) and demonstrate the validity of this approach in a supervised setting (PBT-VAE). We then use Unsupervised Disentanglement Ranking (UDR) as an unsupervised heuristic to score models in our PBT-VAE training and show how models trained this way tend to consistently disentangle only a subset of the generative factors. Building on top of this observation we introduce the recursive rPU-VAE approach.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Anomaly Detection Techniques and Applications
