Multiple Invertible and Partial-Equivariant Function for Latent Vector Transformation to Enhance Disentanglement in VAEs

Hee-Jun Jung; Jaehyoung Jeong; Kangil Kim

arXiv:2502.03740·cs.LG·February 6, 2026

Multiple Invertible and Partial-Equivariant Function for Latent Vector Transformation to Enhance Disentanglement in VAEs

Hee-Jun Jung, Jaehyoung Jeong, Kangil Kim

PDF

Open Access

TL;DR

This paper introduces MIPE-Transformation, a novel method that enhances disentanglement in VAEs by combining invertible, partial-equivariant transformations with exponential-family prior extension, leading to improved performance on multiple datasets.

Contribution

The paper proposes MIPE-Transformation, integrating invertible partial-equivariant mappings and exponential-family priors to improve disentanglement in VAEs, a novel combination not previously explored.

Findings

01

Improved disentanglement scores on 3D Cars, 3D Shapes, and dSprites datasets.

02

Demonstrates the effectiveness of invertible partial-equivariant transformations in VAEs.

03

Extends Gaussian prior to exponential family for better latent representation.

Abstract

Disentanglement learning is central to understanding and reusing learned representations in variational autoencoders (VAEs). Although equivariance has been explored in this context, effectively exploiting it for disentanglement remains challenging. In this paper, we propose a novel method, called Multiple Invertible and Partial-Equivariant Transformation (MIPE-Transformation), which integrates two main parts: (1) Invertible and Partial-Equivariant Transformation (IPE-Transformation), guaranteeing an invertible latent-to-transformed-latent mapping while preserving partial input-to-latent equivariance in the transformed latent space; and (2) Exponential-Family Conversion (EF-Conversion) to extend the standard Gaussian prior to an approximate exponential family via a learnable conversion. In experiments on the 3D Cars, 3D Shapes, and dSprites datasets, MIPE-Transformation improves the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHandwritten Text Recognition Techniques