Disentangled Representation Learning for Parametric Partial Differential Equations
Ning Liu, Lu Zhang, Tian Gao, Yue Yu

TL;DR
This paper introduces DisentangO, a neural operator architecture that learns interpretable latent physical factors from neural operator parameters, improving physical understanding and generalization in PDE systems.
Contribution
It proposes a novel hyper-neural operator with disentangled representations, combining multi-task learning and variational autoencoders for interpretability in neural operators.
Findings
Effectively extracts meaningful latent features.
Enhances physical interpretability of neural operators.
Improves generalization across diverse PDE systems.
Abstract
Neural operators (NOs) excel at learning mappings between function spaces, serving as efficient forward solution approximators for PDE-governed systems. However, as black-box solvers, they offer limited insight into the underlying physical mechanism, due to the lack of interpretable representations of the physical parameters that drive the system. To tackle this challenge, we propose a new paradigm for learning disentangled representations from NO parameters, thereby effectively solving an inverse problem. Specifically, we introduce DisentangO, a novel hyper-neural operator architecture designed to unveil and disentangle latent physical factors of variation embedded within the black-box neural operator parameters. At the core of DisentangO is a multi-task NO architecture that distills the varying parameters of the governing PDE through a task-wise adaptive layer, alongside a variational…
Peer Reviews
Decision·ICLR 2026 Poster
The paper is original. The literature review is good. I like the main idea of using hyper-networks to solve forward and inverse problems in an end-to-end fashion. Such methods have potential to significantly advance the field of SciML if scalable enough. The theory part seems adequate and the assumptions and their limitations are discussed well. While not mathematically sophisticated, the results seem interesting and important for the problem considered in the paper. The paper considers several
This paper makes a scientific machine learning (SciML) contribution. As such, it should be accessible to SciML readership. Unfortunately, I do not think this is the case. There is a lot of language and conventions that are unfamiliar to SciML researchers. It seems the paper is mainly written for a core CS/Ml audience. Overall, this paper was very hard to read. I kept going back and re-reading paragraphs. There is a lot of text, but there are often only vague claims of addressing physical interpr
The paper's primary strength is its originality. It shifts the focus of disentanglement from the data space ($u, f$) to the model's parameter space ($\theta_P^\eta$). This is a creative approach to embedding an inverse solver inside the forward solver's parameterization, using the MetaNO lifting layer as the information bottleneck. The inclusion of a theoretical analysis on identifiability (Section 3.2, Theorems 1 and 2) is a major strength. It provides a formal basis for the claim that the lat
The paper's performance claims are unsubstantiated due to improper baselines in the semi-supervised and unsupervised experiments (Exp 2 & 3). The models are benchmarked against standard VAEs that are not designed for the task and fail (e.g., 61.10% error). To be taken seriously as a neural operator, the method must be benchmarked against other SOTA NOs (like FNO, FUSE, etc.), not just its own backbone. The paper's core hypothesis is that disentangling from parameters is the right approach. It n
It combines the predictive accuracy and physical interpretability in a unified ML-based modelling framework, improving the automation level and usage of data in this area.
This is a straightforward application of VAE and Neural Operators for solving inverse problems. The presentation can be improved.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks
