Conformal Policy Control

Drew Prinster; Clara Fannjiang; Ji Won Park; Kyunghyun Cho; Anqi Liu; Suchi Saria; Samuel Stanton

arXiv:2603.02196·cs.AI·April 17, 2026

Conformal Policy Control

Drew Prinster, Clara Fannjiang, Ji Won Park, Kyunghyun Cho, Anqi Liu, Suchi Saria, Samuel Stanton

PDF

TL;DR

This paper introduces a conformal calibration method that safely regulates policy exploration in high-stakes environments, ensuring risk constraints are met while allowing for performance improvements.

Contribution

It provides a novel, theory-backed approach to safe policy exploration using conformal calibration without requiring model class assumptions or hyperparameter tuning.

Findings

01

Safe exploration is feasible from deployment start.

02

The method provably enforces user-defined risk tolerances.

03

Experiments show performance improvements in diverse applications.

Abstract

An agent must try new behaviors to explore and improve. In high-stakes environments, an agent that violates safety constraints may cause harm and must be taken offline, curtailing any future interaction. Imitating old behavior is safe, but excessive conservatism discourages exploration. How much behavior change is too much? We show how to use any safe reference policy as a probabilistic regulator for any optimized but untested policy. Conformal calibration on data from the safe policy determines how aggressively the new policy can act, while provably enforcing the user's declared risk tolerance. Unlike conservative optimization methods, we do not assume the user has identified the correct model class nor tuned any hyperparameters. Unlike previous conformal methods, our theory provides finite-sample guarantees even for non-monotonic bounded loss functions. Our experiments on applications…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.