Conservative Policy Construction Using Variational Autoencoders for   Logged Data with Missing Values

Mahed Abroshan; Kai Hou Yip; Cem Tekin; Mihaela van der Schaar

arXiv:2109.03747·cs.LG·March 10, 2022

Conservative Policy Construction Using Variational Autoencoders for Logged Data with Missing Values

Mahed Abroshan, Kai Hou Yip, Cem Tekin, Mihaela van der Schaar

PDF

TL;DR

This paper develops a conservative policy construction method using variational autoencoders to handle missing data in logged datasets for high-stakes decision making, ensuring safety and robustness.

Contribution

It introduces a novel conservative policy approach utilizing partial variational autoencoders to effectively manage missing feature data in observational datasets.

Findings

01

Successfully estimates posterior distributions with PVAE

02

Demonstrates safe policy recommendations with missing data

03

Improves decision-making robustness in healthcare applications

Abstract

In high-stakes applications of data-driven decision making like healthcare, it is of paramount importance to learn a policy that maximizes the reward while avoiding potentially dangerous actions when there is uncertainty. There are two main challenges usually associated with this problem. Firstly, learning through online exploration is not possible due to the critical nature of such applications. Therefore, we need to resort to observational datasets with no counterfactuals. Secondly, such datasets are usually imperfect, additionally cursed with missing values in the attributes of features. In this paper, we consider the problem of constructing personalized policies using logged data when there are missing values in the attributes of features in both training and test data. The goal is to recommend an action (treatment) when $\Xt$ , a degraded version of $\Xb$ with missing values, is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.