Identifiable Latent Polynomial Causal Models Through the Lens of Change
Yuhang Liu, Zhen Zhang, Dong Gong, Mingming Gong, Biwei Huang, Anton, van den Hengel, Kun Zhang, Javen Qinfeng Shi

TL;DR
This paper extends causal representation learning to nonlinear polynomial models with exponential family noise, providing theoretical identifiability results and a new empirical method validated on synthetic and real data.
Contribution
It generalizes previous linear Gaussian models to nonlinear polynomial causal models with exponential family noise and introduces a novel empirical estimation method.
Findings
Theoretical identifiability results for nonlinear polynomial causal models.
Proposed empirical method achieves consistent learning of latent causal representations.
Experimental validation on synthetic and real-world data supports theoretical claims.
Abstract
Causal representation learning aims to unveil latent high-level causal representations from observed low-level data. One of its primary tasks is to provide reliable assurance of identifying these latent causal models, known as identifiability. A recent breakthrough explores identifiability by leveraging the change of causal influences among latent causal variables across multiple environments \citep{liu2022identifying}. However, this progress rests on the assumption that the causal relationships among latent causal variables adhere strictly to linear Gaussian models. In this paper, we extend the scope of latent causal models to involve nonlinear causal relationships, represented by polynomial models, and general noise distributions conforming to the exponential family. Additionally, we investigate the necessity of imposing changes on all causal parameters and present partial…
Peer Reviews
Decision·ICLR 2024 poster
IMHO the paper's noteworthy strengths are limited to their idea rather than the execution, therefore, they should be considered (where applicable) as counterfactuals for the moment. Said "potential" strengths are considered one-by-one in the following list (the list is ordered in correspondence to the paper presentation): * Causal Representation Learning is an exciting and challenging avenue of research, naturally, extending prior results, especially the generalization sort of results, through p
The paper suffers from several disadvantages, ranging in importance from minor to more fundamental. The more fundamental ones are of major content/technical concern, and given that this review takes a content-centric approach, they weigh the most for the low scoring. The following list - again one-by-one - aims to provide specific pointers with improvement suggestions if applicable (please note, the list is unordered): * Liu et al. 2022 plays a central role in this work, since this work poses a
* The paper is well-written with a clear description of the various assumptions needed for the theoretical results, along with an extremely good proof sketch! Further, the discussion on prior work is quite easy to follow and places the contributions of the work in a good manner as compared to them. * The identification result provided in the paper uses fairly standard assumptions in the literature, but the application for the task of learning the latent causal relationships (not just the latent
* My major concern with the work is regarding the technical contribution and significance of the proposed identification result. The proof uses common ideas of exponential family distribution [1] to obtain identification guarantees up to permutation and scaling for the noise variables. Further, the idea of using the changing distribution of latent variables identified up to polynomial mixing can be used to obtain permutation and scaling identification guarantees has been explored in prior works
- In general the writing and presentation of the relatively clear and easy to understand. - The paper tackles an important and challenging problem of learning latent causal representations from observational data, and makes significant theoretical contributions by extending the scope of latent causal models to nonlinear and non-Gaussian cases, and relaxing the number of required environments. - The paper did not stop after presenting their main results, but also provides in-depth discussions
I appreciate your feedback. Here's an improved version of the weaknesses section: 1. **Latent structure identifiability**: While the paper makes significant strides in latent variable identifiability, it does not discuss the identifiability of the latent structure under the equivalence class. I understand that once the latent variable values are identified, the latent structure can be recovered trivially. However, given that the latent variables are only recovered up to an equivalence class, es
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Advanced Graph Neural Networks · Explainable Artificial Intelligence (XAI)
