Chumor 1.0: A Truly Funny and Challenging Chinese Humor Understanding   Dataset from Ruo Zhi Ba

Ruiqi He; Yushu He; Longju Bai; Jiarui Liu; Zhenjie Sun; Zenghao Tang,; He Wang; Hanchen Xia; Naihao Deng

arXiv:2406.12754·cs.CL·June 19, 2024·1 cites

Chumor 1.0: A Truly Funny and Challenging Chinese Humor Understanding Dataset from Ruo Zhi Ba

Ruiqi He, Yushu He, Longju Bai, Jiarui Liu, Zhenjie Sun, Zenghao Tang,, He Wang, Hanchen Xia, Naihao Deng

PDF

Open Access 1 Datasets

TL;DR

Chumor 1.0 is a novel Chinese humor dataset from Ruo Zhi Ba that challenges state-of-the-art language models and highlights the cultural nuances in humor understanding.

Contribution

This work introduces Chumor, the first Chinese humor dataset with explanations, and evaluates LLMs' ability to understand culturally specific humor.

Findings

01

Chumor is challenging for SOTA LLMs.

02

Human explanations outperform LLM-generated explanations.

03

The dataset captures culturally nuanced Chinese humor.

Abstract

Existing humor datasets and evaluations predominantly focus on English, lacking resources for culturally nuanced humor in non-English languages like Chinese. To address this gap, we construct Chumor, a dataset sourced from Ruo Zhi Ba (RZB), a Chinese Reddit-like platform dedicated to sharing intellectually challenging and culturally specific jokes. We annotate explanations for each joke and evaluate human explanations against two state-of-the-art LLMs, GPT-4o and ERNIE Bot, through A/B testing by native Chinese speakers. Our evaluation shows that Chumor is challenging even for SOTA LLMs, and the human explanations for Chumor jokes are significantly better than explanations generated by the LLMs.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

MichiganNLP/Chumor
dataset· 36 dl
36 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHumor Studies and Applications

MethodsFocus · ERNIE