Language Models are Alignable Decision-Makers: Dataset and Application   to the Medical Triage Domain

Brian Hu; Bill Ray; Alice Leung; Amy Summerville; David Joy,; Christopher Funk; Arslan Basharat

arXiv:2406.06435·cs.CL·June 11, 2024

Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain

Brian Hu, Bill Ray, Alice Leung, Amy Summerville, David Joy,, Christopher Funk, Arslan Basharat

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a new dataset and software framework demonstrating how large language models can be aligned with ethical decision-making attributes in medical triage, improving trustworthy AI applications.

Contribution

It presents a novel dataset with decision-maker attributes for medical triage and a framework for aligning LLM decisions to these attributes using zero-shot prompting.

Findings

01

LLMs can serve as ethical decision-makers in medical triage.

02

Weighted self-consistency improves model performance.

03

Open-source models like Falcon, Mistral, and Llama 2 can be aligned with DMAs.

Abstract

In difficult decision-making scenarios, it is common to have conflicting opinions among expert human decision-makers as there may not be a single right answer. Such decisions may be guided by different attributes that can be used to characterize an individual's decision. We introduce a novel dataset for medical triage decision-making, labeled with a set of decision-maker attributes (DMAs). This dataset consists of 62 scenarios, covering six different DMAs, including ethical principles such as fairness and moral desert. We present a novel software framework for human-aligned decision-making by utilizing these DMAs, paving the way for trustworthy AI with better guardrails. Specifically, we demonstrate how large language models (LLMs) can serve as ethical decision-makers, and how their decisions can be aligned to different DMAs using zero-shot prompting. Our experiments focus on different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

itm-kitware/llm-alignable-dm
pytorchOfficial

Videos

Language Models are Alignable Decision-Makers: Dataset and Application to the Medical Triage Domain· underline

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling

MethodsSparse Evolutionary Training · Focus · LLaMA