Orthogonal Jacobian Regularization for Unsupervised Disentanglement in Image Generation
Yuxiang Wei, Yupeng Shi, Xiao Liu, Zhilong Ji, Yuan Gao, Zhongqin Wu,, Wangmeng Zuo

TL;DR
This paper introduces Orthogonal Jacobian Regularization (OroJaR), a novel method that encourages disentangled representations in deep generative models by promoting orthogonality in output variations caused by different latent dimensions, improving controllability in image generation.
Contribution
OroJaR is a simple, effective regularization technique that indirectly enforces a diagonal Hessian, enhancing disentanglement in spatially correlated variations within generative models.
Findings
OroJaR outperforms state-of-the-art methods in disentanglement metrics.
The method improves controllability in image generation tasks.
OroJaR effectively disentangles spatially correlated latent factors.
Abstract
Unsupervised disentanglement learning is a crucial issue for understanding and exploiting deep generative models. Recently, SeFa tries to find latent disentangled directions by performing SVD on the first projection of a pre-trained GAN. However, it is only applied to the first layer and works in a post-processing way. Hessian Penalty minimizes the off-diagonal entries of the output's Hessian matrix to facilitate disentanglement, and can be applied to multi-layers.However, it constrains each entry of output independently, making it not sufficient in disentangling the latent directions (e.g., shape, size, rotation, etc.) of spatially correlated variations. In this paper, we propose a simple Orthogonal Jacobian Regularization (OroJaR) to encourage deep generative model to learn disentangled representations. It simply encourages the variation of output caused by perturbations on different…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Image Processing and 3D Reconstruction · Digital Media Forensic Detection
