Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Aryan Kasat; Smriti Singh; Aman Chadha; Vinija Jain

arXiv:2603.21854·cs.AI·March 24, 2026

Reasoning or Rhetoric? An Empirical Analysis of Moral Reasoning Explanations in Large Language Models

Aryan Kasat, Smriti Singh, Aman Chadha, Vinija Jain

PDF

Open Access

TL;DR

This study investigates whether large language models genuinely reason morally or merely mimic moral reasoning, revealing they often produce superficially mature responses that lack true developmental progression and logical coherence.

Contribution

It provides an empirical analysis showing LLMs mimic mature moral reasoning without underlying developmental stages, highlighting systematic inconsistencies and the phenomenon of moral ventriloquism.

Findings

01

Responses mostly align with post-conventional reasoning regardless of model specifics.

02

Models exhibit moral decoupling with inconsistent justification and actions.

03

Cross-dilemma responses are highly consistent and logically indistinguishable.

Abstract

Do large language models reason morally, or do they merely sound like they do? We investigate whether LLM responses to moral dilemmas exhibit genuine developmental progression through Kohlberg's stages of moral development, or whether alignment training instead produces reasoning-like outputs that superficially resemble mature moral judgment without the underlying developmental trajectory. Using an LLM-as-judge scoring pipeline validated across three judge models, we classify more than 600 responses from 13 LLMs spanning a range of architectures, parameter scales, and training regimes across six classical moral dilemmas, and conduct ten complementary analyses to characterize the nature and internal coherence of the resulting patterns. Our results reveal a striking inversion: responses overwhelmingly correspond to post-conventional reasoning (Stages 5-6) regardless of model size,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPsychology of Moral and Emotional Judgment · Child and Animal Learning Development · Ethics and Social Impacts of AI