Safe Policy Learning through Extrapolation: Application to Pre-trial Risk Assessment
Eli Ben-Michael, D. James Greiner, Kosuke Imai, Zhichao Jiang

TL;DR
This paper introduces a robust optimization method to improve deterministic pre-trial risk assessment policies in the US criminal justice system, ensuring safety and limited downside risk while enhancing certain risk classifications.
Contribution
It develops a maximin robust optimization approach for policy learning with deterministic policies, addressing the challenge of policy improvement without stochasticity.
Findings
Certain risk assessment components can be safely improved.
The method provides a safety guarantee under structural assumptions.
Some components remain inconclusive for improvement.
Abstract
Algorithmic recommendations and decisions have become ubiquitous in today's society. Many of these data-driven policies, especially in the realm of public policy, are based on known, deterministic rules to ensure their transparency and interpretability. We examine a particular case of algorithmic pre-trial risk assessments in the US criminal justice system, which provide deterministic classification scores and recommendations to help judges make release decisions. Our goal is to analyze data from a unique field experiment on an algorithmic pre-trial risk assessment to investigate whether the scores and recommendations can be improved. Unfortunately, prior methods for policy learning are not applicable because they require existing policies to be stochastic. We develop a maximin robust optimization approach that partially identifies the expected utility of a policy, and then finds a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
