Adjustment for Confounding using Pre-Trained Representations

Rickmer Schulte; David R\"ugamer; Thomas Nagler

arXiv:2506.14329·stat.ML·October 23, 2025

Adjustment for Confounding using Pre-Trained Representations

Rickmer Schulte, David R\"ugamer, Thomas Nagler

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper explores how pre-trained neural network features can be used to adjust for confounding in causal inference with non-tabular data, addressing challenges of high dimensionality and non-identifiability.

Contribution

It formalizes conditions under which latent features from neural networks enable valid ATE adjustment and inference, highlighting neural networks' ability to adapt to sparsity and intrinsic data structure.

Findings

01

Neural networks can achieve fast convergence rates for latent feature learning.

02

Latent features enable valid adjustment for confounding in non-tabular data.

03

Structural assumptions for linear models are unrealistic for neural network features.

Abstract

There is growing interest in extending average treatment effect (ATE) estimation to incorporate non-tabular data, such as images and text, which may act as sources of confounding. Neglecting these effects risks biased results and flawed scientific conclusions. However, incorporating non-tabular data necessitates sophisticated feature extractors, often in combination with ideas of transfer learning. In this work, we investigate how latent features from pre-trained neural networks can be leveraged to adjust for sources of confounding. We formalize conditions under which these latent features enable valid adjustment and statistical inference in ATE estimation, demonstrating results along the example of double machine learning. We discuss critical challenges inherent to latent feature learning and downstream parameter estimation arising from the high dimensionality and non-identifiability…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rickmer-schulte/pretrained-causal-adjust
pytorchOfficial

Videos

Adjustment for Confounding using Pre-Trained Representations· slideslive

Taxonomy

TopicsAdvanced Causal Inference Techniques