Implicit Causality-biases in humans and LLMs as a tool for benchmarking   LLM discourse capabilities

Florian Kankowski; Torgrim Solstad; Sina Zarriess; Oliver Bott

arXiv:2501.12980·cs.CL·January 23, 2025

Implicit Causality-biases in humans and LLMs as a tool for benchmarking LLM discourse capabilities

Florian Kankowski, Torgrim Solstad, Sina Zarriess, Oliver Bott

PDF

Open Access

TL;DR

This study compares human and multilingual LLM discourse biases related to implicit causality verbs, developing a benchmark to evaluate LLMs' discourse understanding by analyzing coreference, coherence, and referring expression biases.

Contribution

It introduces a benchmark for assessing LLM discourse capabilities using well-established human discourse biases as a proxy.

Findings

01

Largest monolingual LLM shows human-like coreference bias

02

No LLM displayed typical human coherence explanation bias

03

All LLMs preferred subject arguments with simpler referring expressions

Abstract

In this paper, we compare data generated with mono- and multilingual LLMs spanning a range of model sizes with data provided by human participants in an experimental setting investigating well-established discourse biases. Beyond the comparison as such, we aim to develop a benchmark to assess the capabilities of LLMs with discourse biases as a robust proxy for more general discourse understanding capabilities. More specifically, we investigated Implicit Causality verbs, for which psycholinguistic research has found participants to display biases with regard to three phenomena:\ the establishment of (i) coreference relations (Experiment 1), (ii) coherence relations (Experiment 2), and (iii) the use of particular referring expressions (Experiments 3 and 4). With regard to coreference biases we found only the largest monolingual LLM (German Bloom 6.4B) to display more human-like biases.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling

MethodsBLOOM