SarcasmBench: Towards Evaluating Large Language Models on Sarcasm   Understanding

Yazhou Zhang; Chunwang Zou; Zheng Lian; Prayag Tiwari; Jing Qin

arXiv:2408.11319·cs.CL·August 27, 2024·3 cites

SarcasmBench: Towards Evaluating Large Language Models on Sarcasm Understanding

Yazhou Zhang, Chunwang Zou, Zheng Lian, Prayag Tiwari, Jing Qin

PDF

Open Access

TL;DR

This paper evaluates large language models' ability to understand sarcasm, revealing current models underperform compared to supervised baselines, with GPT-4 leading in effectiveness across various prompting methods.

Contribution

It provides a comprehensive benchmarking of LLMs on sarcasm detection, highlighting the limitations of current models and the effectiveness of different prompting strategies.

Findings

01

GPT-4 outperforms other LLMs in sarcasm detection.

02

Few-shot IO prompting is more effective than zero-shot IO and CoT.

03

Current LLMs underperform supervised sarcasm detection models.

Abstract

In the era of large language models (LLMs), the task of ``System I''~-~the fast, unconscious, and intuitive tasks, e.g., sentiment analysis, text classification, etc., have been argued to be successfully solved. However, sarcasm, as a subtle linguistic phenomenon, often employs rhetorical devices like hyperbole and figuration to convey true sentiments and intentions, involving a higher level of abstraction than sentiment analysis. There is growing concern that the argument about LLMs' success may not be fully tenable when considering sarcasm understanding. To address this question, we select eleven SOTA LLMs and eight SOTA pre-trained language models (PLMs) and present comprehensive evaluations on six widely used benchmark datasets through different prompting approaches, i.e., zero-shot input/output (IO) prompting, few-shot IO prompting, chain of thought (CoT) prompting. Our results…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsWildlife Ecology and Conservation

MethodsAttention Is All You Need · Linear Layer · Residual Connection · Multi-Head Attention · Adam · Layer Normalization · Position-Wise Feed-Forward Layer · Dense Connections · Byte Pair Encoding · Absolute Position Encodings