Confronting Label Indeterminacy in Automated Bail Decisions

Cor Steging; Tadeusz Zbiegie\'n

arXiv:2605.04073·cs.LG·May 7, 2026

Confronting Label Indeterminacy in Automated Bail Decisions

Cor Steging, Tadeusz Zbiegie\'n

PDF

TL;DR

This paper examines how to handle uncertain labels in machine learning models for bail decisions, highlighting the impact of different approaches on model behavior and legal legitimacy.

Contribution

It introduces a novel label imputation method and evaluates five approaches to address label indeterminacy in bail decision data.

Findings

01

All methods influence model predictions significantly.

02

Model choice impacts results more than label handling approach.

03

Explainable AI reveals effects on internal decision processes.

Abstract

Bail decisions present a fundamental challenge for data-driven decision support systems. When bail is denied, the counterfactual outcome of whether the defendant would have appeared in court remains unobserved. As a result, historical bail data embed structural label indeterminacy: future decisions are influenced by past decisions whose outcomes are only partially knowable. Building automated systems on such data risks introducing bias and reinforcing feedback loops. This raises a core question for machine-learning systems intended to assist judicial actors: how should cases in which bail was denied be treated during model development? In a case study of bail decisions from the Unified Judicial System of Pennsylvania, we evaluate five contemporary approaches to handling label indeterminacy across three machine learning models, including a novel label imputation method motivated by the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.