Predicting Long Term Sequential Policy Value Using Softer Surrogates

Hyunji Nam; Allen Nie; Ge Gao; Vasilis Syrgkanis; Emma Brunskill

arXiv:2412.20638·cs.AI·February 4, 2025

Predicting Long Term Sequential Policy Value Using Softer Surrogates

Hyunji Nam, Allen Nie, Ge Gao, Vasilis Syrgkanis, Emma Brunskill

PDF

Open Access

TL;DR

This paper introduces a method to predict the long-term outcomes of policies in healthcare by combining short-term on-policy data with long-term historical data, addressing challenges in off-policy evaluation with novel actions.

Contribution

It proposes a novel surrogacy-based approach for long-term policy value prediction using short-term data, applicable when new actions are introduced.

Findings

01

Accurate long-term policy value predictions with only 10% of data observed.

02

Method effective in simulated healthcare scenarios like HIV and sepsis management.

03

Finite sample analysis supports estimator robustness.

Abstract

Off-policy policy evaluation (OPE) estimates the outcome of a new policy using historical data collected from a different policy. However, existing OPE methods cannot handle cases when the new policy introduces novel actions. This issue commonly occurs in real-world domains, like healthcare, as new drugs and treatments are continuously developed. Novel actions necessitate on-policy data collection, which can be burdensome and expensive if the outcome of interest takes a substantial amount of time to observe--for example, in multi-year clinical trials. This raises a key question of how to predict the long-term outcome of a policy after only observing its short-term effects? Though in general this problem is intractable, under some surrogacy conditions, the short-term on-policy data can be combined with the long-term historical data to make accurate predictions about the new policy's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMonetary Policy and Economic Impact