Risk-Controlled Post-Processing of Decision Policies
Sunay Joshi, Tao Wang, Hamed Hassani, Edgar Dobriban

TL;DR
This paper introduces a risk-controlled post-processing method for decision policies that maximizes agreement with a baseline while satisfying risk constraints, with theoretical guarantees and practical experiments.
Contribution
It develops a threshold-based post-processing algorithm with provable risk control and excess risk bounds, applicable to various decision-making tasks.
Findings
The optimal policy follows the baseline except where it significantly reduces violation risk.
The algorithm achieves an expected excess risk of O(log n/n) under regularity conditions.
Experiments show the method effectively meets risk budgets while maintaining baseline agreement.
Abstract
Predictive models are often deployed through existing decision policies that stakeholders are reluctant to change unless a risk constraint requires intervention. We study risk-controlled post-processing: given a deterministic baseline policy, choose a new policy that maximizes agreement with the baseline subject to a chance constraint on a user-specified loss. At the population level, we show that the optimal policy has a threshold structure: it follows the baseline except on contexts where switching to the oracle fallback policy yields a large reduction in conditional violation risk. At the finite-sample level, given a fitted fallback policy and score, we develop a post-processing algorithm that uses calibration data to select a threshold. Leveraging tools from algorithmic stability and stochastic processes, we show that under regularity conditions, in the i.i.d. setting, the expected…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
