Adversarial Orthogonal Regression: Two non-Linear Regressions for Causal Inference
M. Reza Heydari, Saber Salehkaleybar, Kun Zhang

TL;DR
This paper introduces two adversarial nonlinear regression methods, AdOR and AdOSE, for causal inference that do not assume noise distribution and effectively learn causal structures through minimax training.
Contribution
The paper presents novel adversarial regression techniques, AdOR and AdOSE, capable of causal inference without noise assumptions, advancing causal structure learning methods.
Findings
Both methods outperform previous solutions in synthetic experiments.
AdOR accurately estimates mutual information between residuals and inputs.
AdOSE effectively models conditional distributions in causal models.
Abstract
We propose two nonlinear regression methods, named Adversarial Orthogonal Regression (AdOR) for additive noise models and Adversarial Orthogonal Structural Equation Model (AdOSE) for the general case of structural equation models. Both methods try to make the residual of regression independent from regressors while putting no assumption on noise distribution. In both methods, two adversarial networks are trained simultaneously where a regression network outputs predictions and a loss network that estimates mutual information (in AdOR) and KL-divergence (in AdOSE). These methods can be formulated as a minimax two-player game; at equilibrium, AdOR finds a deterministic map between inputs and output and estimates mutual information between residual and inputs, while AdOSE estimates a conditional probability distribution of output given inputs. The proposed methods can be used as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Statistical Methods and Models · Statistical Methods and Inference · Gaussian Processes and Bayesian Inference
