Learning from Disagreement: Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care

Prabhjot Singh; Abhishek Gupta; Chris Betz; Abe Flansburg; Brett Ives; Sudeep Lama; Jung Hoon Son

arXiv:2604.28010·cs.LG·May 19, 2026

Learning from Disagreement: Clinician Overrides as Implicit Preference Signals for Clinical AI in Value-Based Care

Prabhjot Singh, Abhishek Gupta, Chris Betz, Abe Flansburg, Brett Ives, Sudeep Lama, Jung Hoon Son

PDF

TL;DR

This paper proposes a novel framework for learning from clinician override data in clinical AI, leveraging implicit preferences and addressing bias to improve decision support in value-based care.

Contribution

It introduces a formal preference learning framework with a taxonomy, capability-conditioned preferences, and a dual learning architecture to enhance clinical AI training.

Findings

01

Override data has favorable properties like longitudinal density and outcome labels.

02

The dual learning architecture prevents suppression bias in recommendations.

03

Training in outcome-based, longitudinal environments aligns AI with patient trajectories.

Abstract

We reframe clinician overrides of clinical AI recommendations as implicit preference data - the same signal structure exploited by reinforcement learning from human feedback (RLHF), but richer: the annotator is a domain expert, the alternatives carry real consequences, and downstream outcomes are observable. We present a formal framework extending standard preference learning with three contributions: a five-category override taxonomy mapping override types to distinct model update targets; a preference formulation conditioned on patient state s, organizational context c, and clinician capability kappa, where kappa decomposes into execution capability kappa-exec and alignment capability kappa-align; and a dual learning architecture that jointly trains a reward model and a capability model via alternating optimization, preventing a failure mode we term suppression bias-the systematic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.