Morality is Contextual: Learning Interpretable Moral Contexts from Human Data with Probabilistic Clustering and Large Language Models

Geoffroy Morlat; Marceau Nahon; Augustin Chartouny; Raja Chatila; Ismael T. Freire; Mehdi Khamassi

arXiv:2512.21439·cs.CL·December 29, 2025

Morality is Contextual: Learning Interpretable Moral Contexts from Human Data with Probabilistic Clustering and Large Language Models

Geoffroy Morlat, Marceau Nahon, Augustin Chartouny, Raja Chatila, Ismael T. Freire, Mehdi Khamassi

PDF

Open Access 3 Reviews

TL;DR

This paper introduces COMETH, a framework that models how context influences moral judgments by combining human data, probabilistic clustering, and large language models, resulting in more accurate and interpretable moral predictions.

Contribution

It presents a new dataset, a reproducible pipeline for moral context learning, and an interpretable model that improves alignment with human judgments over standard LLM prompting.

Findings

01

COMETH roughly doubles alignment with human judgments compared to LLM prompting.

02

The framework identifies key contextual features influencing moral judgments.

03

It provides transparent explanations for moral predictions.

Abstract

Moral actions are judged not only by their outcomes but by the context in which they occur. We present COMETH (Contextual Organization of Moral Evaluation from Textual Human inputs), a framework that integrates a probabilistic context learner with LLM-based semantic abstraction and human moral evaluations to model how context shapes the acceptability of ambiguous actions. We curate an empirically grounded dataset of 300 scenarios across six core actions (violating Do not kill, Do not deceive, and Do not break the law) and collect ternary judgments (Blame/Neutral/Support) from N=101 participants. A preprocessing pipeline standardizes actions via an LLM filter and MiniLM embeddings with K-means, producing robust, reproducible core-action clusters. COMETH then learns action-specific moral contexts by clustering scenarios online from human judgment distributions using principled divergence…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 0Confidence 4

Strengths

N/A

Weaknesses

This paper exploits the conference submission format by substantially shrinking the page margins. Hence, I will desk reject the paper for the severe format violation.

Reviewer 02Rating 6Confidence 3

Strengths

The paper offers an original reframing of context‑sensitive moral evaluation. Rather than predicting a single label from text, it learns action‑specific moral “context models” by clustering scenarios according to the empirical distributions of human judgments, with the number of contexts per action emerging from the data. It then generalizes and explains these contexts via an interpretable module that uses LLM‑derived, non‑evaluative binary features and learns feature weights to assign new scena

Weaknesses

The study pools 101 raters split into six groups (50 scenarios each; so about 16–17 raters per scenario) with items presented in multiple languages and with demographic information collected. Yet the modeling and evaluation are fully pooled, with no stratification by language or demographics as provided in the survey materials (Appendix A.1). In a setting where moral judgments are culturally sensitive, as the annotator pool per scenario increases, the empirical blame/neutral/support distribution

Reviewer 03Rating 2Confidence 3

Strengths

A dataset with different scenarios and actions is created. The proposed framework achieves better performance w.r.t. alignment rates, compared to baseline end-to-end methods. The idea of contextual moral evaluation and alignment is novel.

Weaknesses

The experimental evaluations are weak. The evaluations on clustering are hard to tell whether the resultant clusters are corresponding to different contexts. The claimed interpretability is also lacking support. Figure 5 has two columns with the same feature of "approved directive(s)", but the contributions to assigning scenarios to contexts (C1 & C2) are different. According to the assigning weights, the contexts C1 and C2 are quite different. However, L366 mentioned "scenarios mentioning an

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExplainable Artificial Intelligence (XAI) · Psychology of Moral and Emotional Judgment · Ethics and Social Impacts of AI