Learning Optimal Policies from Observational Data

Onur Atan; William R. Zame; M van der Schaar

arXiv:1802.08679·cs.AI·February 26, 2018·6 cites

Learning Optimal Policies from Observational Data

Onur Atan, William R. Zame, M van der Schaar

PDF

Open Access

TL;DR

This paper addresses the challenge of learning optimal policies from observational data by providing theoretical bounds on estimation errors and proposing a domain adversarial neural network approach to mitigate selection bias and missing counterfactuals.

Contribution

It introduces a novel theoretical framework connecting counterfactual estimation errors to domain adaptation and develops a principled neural network method for policy optimization.

Findings

01

Theoretical bounds on counterfactual estimation errors are established.

02

Domain adversarial neural networks improve policy learning from observational data.

03

Experimental results demonstrate effectiveness on medical and UCI datasets.

Abstract

Choosing optimal (or at least better) policies is an important problem in domains from medicine to education to finance and many others. One approach to this problem is through controlled experiments/trials - but controlled experiments are expensive. Hence it is important to choose the best policies on the basis of observational data. This presents two difficult challenges: (i) missing counterfactuals, and (ii) selection bias. This paper presents theoretical bounds on estimation errors of counterfactuals from observational data by making connections to domain adaptation theory. It also presents a principled way of choosing optimal policies using domain adversarial neural networks. We illustrate the effectiveness of domain adversarial training together with various features of our algorithm on a semi-synthetic breast cancer dataset and a supervised UCI dataset (Statlog).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Bayesian Modeling and Causal Inference · Machine Learning and Algorithms

MethodsCounterfactuals Explanations