What Does the AI Doctor Value? Auditing Pluralism in the Clinical Ethics of Language Models

Payal Chandak; Victoria Alkin; David Wu; Maya Dagan; Taposh Dutta Roy; Maria Clara Saad Menezes; Ayush Noori; Nirali Somia; John S. Brownstein; Ran Balicer; Rebecca W. Brendel; Noa Dagan; Isaac S. Kohane; Gabriel A. Brat

arXiv:2605.18738·cs.AI·May 19, 2026

What Does the AI Doctor Value? Auditing Pluralism in the Clinical Ethics of Language Models

Payal Chandak, Victoria Alkin, David Wu, Maya Dagan, Taposh Dutta Roy, Maria Clara Saad Menezes, Ayush Noori, Nirali Somia, John S. Brownstein, Ran Balicer, Rebecca W. Brendel, Noa Dagan, Isaac S. Kohane, Gabriel A. Brat

PDF

TL;DR

This paper introduces a framework for auditing the value pluralism of medical language models, revealing their systematic biases and potential to undermine clinical diversity in ethical decision-making.

Contribution

It presents a novel benchmark and attribution method to analyze and compare the ethical value priorities of large language models in medicine.

Findings

01

Models exhibit physician-level heterogeneity in value priorities.

02

Models discuss competing values before making decisions.

03

Some models significantly underweight patient autonomy.

Abstract

Medicine is inherently pluralistic. Principles such as autonomy, beneficence, nonmaleficence, and justice routinely conflict, and such ethical dilemmas often sharply divide reasonable physicians. Good clinical practice navigates these tensions in concert with each patient's values rather than imposing a single ethical stance. The ethical values that large language models bring to medical advice, however, have not been systematically examined. We present a framework for auditing value pluralism in medical AI, comprising a benchmark of clinician-verified dilemmas and an attribution method that recovers value priorities directly from decisions. The ecosystem of frontier models spans physician-level value heterogeneity, and models discuss competing values in their reasoning (Overton pluralism) before committing to a decision. However, individual model decisions are near-deterministic across…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.