Large-scale moral machine experiment on large language models
Muhammad Shahrul Zaim bin Ahmad, Kazuhiro Takemoto

TL;DR
This study evaluates 52 large language models' moral decision-making in autonomous driving, revealing that larger models tend to align more with human ethics, but updates and ethical emphasis vary, impacting practical deployment.
Contribution
It provides a comprehensive comparison of diverse LLMs' moral judgments, analyzing factors like size, updates, and architecture, to inform ethical AI development in autonomous systems.
Findings
Proprietary and large open-source models align closely with human moral judgments.
Model size negatively correlates with distance from human preferences in open-source models.
Model updates do not consistently improve moral judgment alignment.
Abstract
The rapid advancement of Large Language Models (LLMs) and their potential integration into autonomous driving systems necessitates understanding their moral decision-making capabilities. While our previous study examined four prominent LLMs using the Moral Machine experimental framework, the dynamic landscape of LLM development demands a more comprehensive analysis. Here, we evaluate moral judgments across 52 different LLMs, including multiple versions of proprietary models (GPT, Claude, Gemini) and open-source alternatives (Llama, Gemma), to assess their alignment with human moral preferences in autonomous driving scenarios. Using a conjoint analysis framework, we evaluated how closely LLM responses aligned with human preferences in ethical dilemmas and examined the effects of model size, updates, and architecture. Results showed that proprietary models and open-source models exceeding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection
