Physically Interpretable Representation Learning with Gaussian Mixture Variational AutoEncoder (GM-VAE)

Tiffany Fan; Murray Cutforth; Marta D'Elia; Alexandre Cortiella; Alireza Doostan; Eric Darve

arXiv:2511.21883·cs.LG·December 1, 2025

Physically Interpretable Representation Learning with Gaussian Mixture Variational AutoEncoder (GM-VAE)

Tiffany Fan, Murray Cutforth, Marta D'Elia, Alexandre Cortiella, Alireza Doostan, Eric Darve

PDF

Open Access

TL;DR

This paper introduces GM-VAE, a novel variational autoencoder framework with an EM-inspired training scheme and a spectral interpretability metric, enabling stable, physically meaningful representations of complex scientific data.

Contribution

The paper presents GM-VAE with a block-coordinate descent training method and a new interpretability metric, improving physical interpretability and stability over traditional VAEs.

Findings

01

GM-VAE produces smooth, physically consistent latent manifolds.

02

The method accurately clusters physical regimes in complex datasets.

03

Training stability is enhanced compared to conventional VAEs.

Abstract

Extracting compact, physically interpretable representations from high-dimensional scientific data is a persistent challenge due to the complex, nonlinear structures inherent in physical systems. We propose a Gaussian Mixture Variational Autoencoder (GM-VAE) framework designed to address this by integrating an Expectation-Maximization (EM)-inspired training scheme with a novel spectral interpretability metric. Unlike conventional VAEs that jointly optimize reconstruction and clustering (often leading to training instability), our method utilizes a block-coordinate descent strategy, alternating between expectation and maximization steps. This approach stabilizes training and naturally aligns latent clusters with distinct physical regimes. To objectively evaluate the learned representations, we introduce a quantitative metric based on graph-Laplacian smoothness, which measures the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis · Gaussian Processes and Bayesian Inference