Policy learning under constraint: Maximizing a primary outcome while controlling an adverse event

Laura Fuentes-Vicente; Mathieu Even; Gaelle Dormion; Julie Josse; Antoine Chambaz

arXiv:2601.22717·stat.ME·February 2, 2026

Policy learning under constraint: Maximizing a primary outcome while controlling an adverse event

Laura Fuentes-Vicente, Mathieu Even, Gaelle Dormion, Julie Josse, Antoine Chambaz

PDF

Open Access

TL;DR

This paper introduces PLUC, a method for policy learning that maximizes primary health outcomes while controlling adverse event risks, using convex optimization and iterative algorithms in observational data settings.

Contribution

The paper proposes a novel approach, PLUC, that incorporates adverse event constraints into policy learning by optimizing a convex Lagrangian, with practical implementation and experimental validation.

Findings

01

PLUC effectively balances primary outcomes and adverse event risks.

02

The method outperforms existing approaches in numerical experiments.

03

An R package implementation facilitates practical use.

Abstract

A medical policy aims to support decision-making by mapping patient characteristics to individualized treatment recommendations. Standard approaches typically optimize a single outcome criterion. For example, recommending treatment according to the sign of the Conditional Average Treatment Effect (CATE) maximizes the policy "value" by exploiting treatment effect heterogeneity. This point of view shifts policy learning towards the challenge of learning a reliable CATE estimator. However, in multi-outcome settings, such strategies ignore the risk of adverse events, despite their relevance. PLUC (Policy Learning Under Constraint) addresses this challenges by learning an estimator of the CATE that yields smoothed policies controlling the probability of an adverse event in observational settings. Inspired by insights from EP-learning, PLUC involves the optimization of strongly convex…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Causal Inference Techniques · Statistical Methods and Inference · Risk and Portfolio Optimization