Narrative over Numbers: The Identifiable Victim Effect and its Amplification Under Alignment and Reasoning in Large Language Models
Syed Rifat Raiyan

TL;DR
This study empirically investigates the Identifiable Victim Effect in large language models, revealing how alignment and reasoning influence the effect's magnitude and implications for AI ethics.
Contribution
First large-scale empirical analysis of IVE in LLMs, showing how alignment training and reasoning methods modulate the effect.
Findings
Alignment training amplifies IVE in instruction-tuned models.
Reasoning-specialized models can invert the IVE.
Chain-of-Thought prompting increases IVE, while utilitarian CoT reduces it.
Abstract
The Identifiable Victim Effect (IVE) the tendency to allocate greater resources to a specific, narratively described victim than to a statistically characterized group facing equivalent hardship is one of the most robust findings in moral psychology and behavioural economics. As large language models (LLMs) assume consequential roles in humanitarian triage, automated grant evaluation, and content moderation, a critical question arises: do these systems inherit the affective irrationalities present in human moral reasoning? We present the first systematic, large-scale empirical investigation of the IVE in LLMs, comprising N=51,955 validated API trials across 16 frontier models spanning nine organizational lineages (Google, Anthropic, OpenAI, Meta, DeepSeek, xAI, Alibaba, IBM, and Moonshot). Using a suite of ten experiments porting and extending canonical paradigms from Small…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
