Large-scale moral machine experiment on large language models

Muhammad Shahrul Zaim bin Ahmad; Kazuhiro Takemoto

arXiv:2411.06790·cs.CY·May 23, 2025·2 cites

Large-scale moral machine experiment on large language models

Muhammad Shahrul Zaim bin Ahmad, Kazuhiro Takemoto

PDF

Open Access 1 Repo

TL;DR

This study evaluates 52 large language models' moral decision-making in autonomous driving, revealing that larger models tend to align more with human ethics, but updates and ethical emphasis vary, impacting practical deployment.

Contribution

It provides a comprehensive comparison of diverse LLMs' moral judgments, analyzing factors like size, updates, and architecture, to inform ethical AI development in autonomous systems.

Findings

01

Proprietary and large open-source models align closely with human moral judgments.

02

Model size negatively correlates with distance from human preferences in open-source models.

03

Model updates do not consistently improve moral judgment alignment.

Abstract

The rapid advancement of Large Language Models (LLMs) and their potential integration into autonomous driving systems necessitates understanding their moral decision-making capabilities. While our previous study examined four prominent LLMs using the Moral Machine experimental framework, the dynamic landscape of LLM development demands a more comprehensive analysis. Here, we evaluate moral judgments across 52 different LLMs, including multiple versions of proprietary models (GPT, Claude, Gemini) and open-source alternatives (Llama, Gemma), to assess their alignment with human moral preferences in autonomous driving scenarios. Using a conjoint analysis framework, we evaluated how closely LLM responses aligned with human preferences in ethical dilemmas and examined the effects of model size, updates, and architecture. Results showed that proprietary models and open-source models exceeding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

kztakemoto/mmllm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection