Scaling Laws for Moral Machine Judgment in Large Language Models

Kazuhiro Takemoto

arXiv:2601.17637·cs.CY·May 4, 2026

Scaling Laws for Moral Machine Judgment in Large Language Models

Kazuhiro Takemoto

PDF

TL;DR

This study demonstrates that larger language models systematically improve in moral judgment alignment with human preferences, following a predictable power-law relationship across diverse architectures.

Contribution

It provides the first empirical evidence of scaling laws governing moral judgment capabilities in large language models, extending scaling law research to value-based tasks.

Findings

01

Moral judgment alignment improves with model size following a power-law.

02

Extended reasoning models show better moral alignment, especially in smaller models.

03

Variance in moral judgment decreases at larger scales, indicating more reliable judgments.

Abstract

Autonomous systems increasingly require moral judgment capabilities, yet whether these capabilities scale predictably with model size remains unexplored. We systematically evaluate 75 large language model configurations (0.27B--1000B parameters) using the Moral Machine framework, measuring alignment with human preferences in life-death dilemmas. We observe a consistent power-law relationship with distance from human preferences ( $D$ ) decreasing as $D \propto S^{- 0.10 \pm 0.01}$ ( $R^{2} = 0.50$ , $p < 0.001$ ) where $S$ is model size. Mixed-effects models confirm this relationship persists after controlling for model family and reasoning capabilities. Extended reasoning models show significantly better alignment, with this effect being more pronounced in smaller models (size $\times$ reasoning interaction: $p = 0.024$ ). The relationship holds across diverse architectures, while variance decreases at…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.