Evaluating Large Language Models' Ability Using a Psychiatric Screening   Tool Based on Metaphor and Sarcasm Scenarios

Hiromu Yakura

arXiv:2309.10744·cs.CL·July 23, 2024·1 cites

Evaluating Large Language Models' Ability Using a Psychiatric Screening Tool Based on Metaphor and Sarcasm Scenarios

Hiromu Yakura

PDF

Open Access 1 Repo

TL;DR

This paper assesses large language models' understanding of metaphor and sarcasm using a psychiatric screening tool, revealing improved metaphor comprehension with larger models but no similar gains for sarcasm, highlighting the need for specialized training.

Contribution

It introduces a novel evaluation of LLMs' nuanced social communication skills using a standardized psychiatric screening test.

Findings

01

Larger models show better metaphor understanding.

02

No significant improvement in sarcasm comprehension with increased model size.

03

Sarcasm comprehension may require emotionally grounded training strategies.

Abstract

Metaphors and sarcasm are precious fruits of our highly evolved social communication skills. However, children with the condition then known as Asperger syndrome are known to have difficulties in comprehending sarcasm, even if they possess adequate verbal IQs for understanding metaphors. Accordingly, researchers had employed a screening test that assesses metaphor and sarcasm comprehension to distinguish Asperger syndrome from other conditions with similar external behaviors (e.g., attention-deficit/hyperactivity disorder). This study employs a standardized test to evaluate recent large language models' (LLMs) understanding of nuanced human communication. The results indicate improved metaphor comprehension with increased model parameters; however, no similar improvement was observed for sarcasm comprehension. Considering that a human's ability to grasp sarcasm has been associated with…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hiromu/llm-msst
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsLanguage, Metaphor, and Cognition · Language Development and Disorders · Neurobiology of Language and Bilingualism