# Causal Dantzig: fast inference in linear structural equation models with   hidden variables under additive interventions

**Authors:** Dominik Rothenh\"ausler, Peter B\"uhlmann, Nicolai Meinshausen

arXiv: 1706.06159 · 2018-06-19

## TL;DR

Causal Dantzig offers a computationally efficient method for causal inference in linear structural equation models with hidden variables, leveraging invariance under specific interventions to handle large-scale data.

## Contribution

It introduces a new approach using inner-product invariance for fast causal inference, addressing computational challenges and hidden confounders in large-scale linear models.

## Key findings

- Addresses computational efficiency for large datasets
- Provides asymptotic confidence intervals in low-dimensional settings
- Offers predictive guarantees in non-identifiable cases

## Abstract

Causal inference is known to be very challenging when only observational data are available. Randomized experiments are often costly and impractical and in instrumental variable regression the number of instruments has to exceed the number of causal predictors. It was recently shown in Peters et al. [2016] that causal inference for the full model is possible when data from distinct observational environments are available, exploiting that the conditional distribution of a response variable is invariant under the correct causal model. Two shortcomings of such an approach are the high computational effort for large-scale data and the assumed absence of hidden confounders. Here we show that these two shortcomings can be addressed if one is willing to make a more restrictive assumption on the type of interventions that generate different environments. Thereby, we look at a different notion of invariance, namely inner-product invariance. By avoiding a computationally cumbersome reverse-engineering approach such as in Peters et al. [2016], it allows for large-scale causal inference in linear structural equation models. We discuss identifiability conditions for the causal parameter and derive asymptotic confidence intervals in the low-dimensional setting. In the case of non-identifiability we show that the solution set of causal Dantzig has predictive guarantees under certain interventions. We derive finite-sample bounds in the high-dimensional setting and investigate its performance on simulated datasets.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1706.06159/full.md

## Figures

23 figures with captions in the complete paper: https://tomesphere.com/paper/1706.06159/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/1706.06159/full.md

---
Source: https://tomesphere.com/paper/1706.06159