LLMs as mediators: Can they diagnose conflicts accurately?

\"Ozgecan Ko\c{c}ak (Emory University); Phanish Puranam (INSEAD); Af\c{s}ar Yegin (Kadir Has University)

arXiv:2412.14675·cs.CL·November 24, 2025

LLMs as mediators: Can they diagnose conflicts accurately?

\"Ozgecan Ko\c{c}ak (Emory University), Phanish Puranam (INSEAD), Af\c{s}ar Yegin (Kadir Has University)

PDF

TL;DR

This study evaluates GPT 3.5 and GPT 4's ability to diagnose conflict sources, revealing they understand causal and moral distinctions but tend to misjudge their prevalence, with GPT 4 performing better.

Contribution

First empirical assessment of LLMs' capacity to identify causal versus moral disagreements in conflict mediation tasks.

Findings

01

Both LLMs understand causal and moral distinctions similarly to humans.

02

LLMs tend to overestimate causal disagreements and underestimate moral disagreements.

03

GPT 4 outperforms GPT 3.5 and humans in diagnosing conflict sources.

Abstract

Prior research indicates that to be able to mediate conflict, observers of disagreements between parties must be able to reliably distinguish the sources of their disagreement as stemming from differences in beliefs about what is true (causality) vs. differences in what they value (morality). In this paper, we test if OpenAI's Large Language Models GPT 3.5 and GPT 4 can perform this task and whether one or other type of disagreement proves particularly challenging for LLM's to diagnose. We replicate study 1 in Ko\c{c}ak et al. (2003), which employes a vignette design, with OpenAI's GPT 3.5 and GPT 4. We find that both LLMs have similar semantic understanding of the distinction between causal and moral codes as humans and can reliably distinguish between them. When asked to diagnose the source of disagreement in a conversation, both LLMs, compared to humans, exhibit a tendency to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsResidual Connection · Linear Layer · Weight Decay · Cosine Annealing · Linear Warmup With Cosine Annealing · Discriminative Fine-Tuning · Softmax · Attention Dropout · Attention Is All You Need · Dense Connections