Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context

Maggie Mi; Aline Villavicencio; Nafise Sadat Moosavi

arXiv:2410.16069·cs.CL·July 17, 2025

Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context

Maggie Mi, Aline Villavicencio, Nafise Sadat Moosavi

PDF

Open Access 1 Datasets 1 Video

TL;DR

This paper investigates the ability of large language models to understand idiomatic expressions in context, revealing their limitations in disambiguating idiomatic meaning despite high performance on standard tasks.

Contribution

The authors create a novel contrastive dataset to specifically test LLMs' contextual understanding of idioms and analyze factors like frequency and sentence probability affecting performance.

Findings

01

LLMs often fail to resolve idiomaticity when context is necessary.

02

Model performance improves on sentences with higher likelihood.

03

Collocational frequency influences model accuracy.

Abstract

Human processing of idioms relies on understanding the contextual sentences in which idioms occur, as well as language-intrinsic features such as frequency and speaker-intrinsic factors like familiarity. While LLMs have shown high performance on idiomaticity detection tasks, this success may be attributed to reasoning shortcuts in existing datasets. To this end, we construct a novel, controlled contrastive dataset designed to test whether LLMs can effectively use context to disambiguate idiomatic meaning. Additionally, we explore how collocational frequency and sentence probability influence model performance. Our findings reveal that LLMs often fail to resolve idiomaticity when it is required to attend to the surrounding context, and that models perform better on sentences that have higher likelihood. The collocational frequency of expressions also impacts performance. We make our code…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

mmi01/DICE
dataset· 23 dl
23 dl

Videos

Rolling the DICE on Idiomaticity: How LLMs Fail to Grasp Context· underline

Taxonomy

TopicsNatural Language Processing Techniques · Library Science and Information Systems · Translation Studies and Practices