Learning Decision Policies with Instrumental Variables through Double Machine Learning

Daqian Shao; Ashkan Soleymani; Francesco Quinzan; Marta Kwiatkowska

arXiv:2405.08498·cs.LG·June 25, 2025

Learning Decision Policies with Instrumental Variables through Double Machine Learning

Daqian Shao, Ashkan Soleymani, Francesco Quinzan, Marta Kwiatkowska

PDF

Open Access 1 Repo

TL;DR

This paper introduces DML-IV, a novel non-linear instrumental variable regression method that reduces bias in two-stage models, enabling more accurate causal inference and policy learning in confounded data settings.

Contribution

The paper proposes DML-IV, a bias-reducing, double machine learning-based IV regression approach that improves causal effect estimation and policy learning in the presence of confounders.

Findings

01

DML-IV achieves strong convergence and $O(N^{-1/2})$ suboptimality guarantees.

02

DML-IV outperforms existing IV regression methods on benchmarks.

03

DML-IV learns high-performing policies despite confounding and instrumental variables.

Abstract

A common issue in learning decision-making policies in data-rich settings is spurious correlations in the offline dataset, which can be caused by hidden confounders. Instrumental variable (IV) regression, which utilises a key unconfounded variable known as the instrument, is a standard technique for learning causal relationships between confounded action, outcome, and context variables. Most recent IV regression algorithms use a two-stage approach, where a deep neural network (DNN) estimator learnt in the first stage is directly plugged into the second stage, in which another DNN is used to estimate the causal effect. Naively plugging the estimator can cause heavy bias in the second stage, especially when regularisation bias is present in the first stage estimator. We propose DML-IV, a non-linear IV regression method that reduces the bias in two-stage IV regressions and effectively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

shaodaqian/DML-IV
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Neural Networks and Applications