Beating the Winner's Curse via Inference-Aware Policy Optimization

Hamsa Bastani; Osbert Bastani; Bryce McLaughlin

arXiv:2510.18161·stat.ML·February 10, 2026

Beating the Winner's Curse via Inference-Aware Policy Optimization

Hamsa Bastani, Osbert Bastani, Bryce McLaughlin

PDF

Open Access

TL;DR

This paper introduces inference-aware policy optimization, a method that balances estimated policy improvements with the likelihood of passing significance tests, thereby reducing the winner's curse in policy learning.

Contribution

It proposes a novel optimization strategy that accounts for downstream evaluation significance, characterizes the Pareto frontier of tradeoffs, and enables better policy selection.

Findings

01

Effectively reduces winner's curse in policy optimization.

02

Balances estimated improvements with statistical significance.

03

Demonstrates improved policy evaluation in simulations.

Abstract

There has been a surge of recent interest in automatically learning policies to target treatment decisions based on rich individual covariates. In addition, practitioners want confidence that the learned policy has better performance than the incumbent policy according to downstream policy evaluation. However, due to the winner's curse -- an issue where the policy optimization procedure exploits prediction errors rather than finding actual improvements -- predicted performance improvements are often not substantiated by downstream policy evaluation. To address this challenge, we propose a novel strategy called inference-aware policy optimization, which modifies policy optimization to account for how the policy will be evaluated downstream. Specifically, it optimizes not only for the estimated objective value, but also for the chances that the estimate of the policy's improvement passes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Machine Learning in Healthcare · Machine Learning and Data Classification