Right vs. Right: Can LLMs Make Tough Choices?

Jiaqing Yuan; Pradeep K. Murukannaiah; Munindar P. Singh

arXiv:2412.19926·cs.CL·December 31, 2024

Right vs. Right: Can LLMs Make Tough Choices?

Jiaqing Yuan, Pradeep K. Murukannaiah, Munindar P. Singh

PDF

Open Access

TL;DR

This paper evaluates how large language models handle ethical dilemmas, revealing their preferences, consistency, and limitations in moral reasoning and response alignment.

Contribution

It introduces a new dataset of 1,730 ethical dilemmas and systematically assesses LLMs' moral sensitivity, consistency, and influence of explicit guidelines.

Findings

01

LLMs show preferences for certain moral values.

02

Larger LLMs tend to support deontological ethics.

03

Explicit guidelines improve moral alignment.

Abstract

An ethical dilemma describes a choice between two "right" options involving conflicting moral values. We present a comprehensive evaluation of how LLMs navigate ethical dilemmas. Specifically, we investigate LLMs on their (1) sensitivity in comprehending ethical dilemmas, (2) consistency in moral value choice, (3) consideration of consequences, and (4) ability to align their responses to a moral value preference explicitly or implicitly specified in a prompt. Drawing inspiration from a leading ethical framework, we construct a dataset comprising 1,730 ethical dilemmas involving four pairs of conflicting values. We evaluate 20 well-known LLMs from six families. Our experiments reveal that: (1) LLMs exhibit pronounced preferences between major value pairs, and prioritize truth over loyalty, community over individual, and long-term over short-term considerations. (2) The larger LLMs tend…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLegal Systems and Judicial Processes

MethodsALIGN