Image Matters: A New Dataset and Empirical Study for Multimodal Hyperbole Detection
Huixuan Zhang, Xiaojun Wan

TL;DR
This paper introduces a new multimodal dataset from Weibo for hyperbole detection, evaluates various models on it, and analyzes cross-domain performance, advancing understanding of multimodal exaggeration detection.
Contribution
It presents the first multimodal hyperbole detection dataset and benchmarks, exploring the roles of text and images and assessing model performance across topics.
Findings
Pre-trained multimodal encoders show varying effectiveness.
Cross-domain performance varies significantly among models.
The dataset serves as a benchmark for future research.
Abstract
Hyperbole, or exaggeration, is a common linguistic phenomenon. The detection of hyperbole is an important part of understanding human expression. There have been several studies on hyperbole detection, but most of which focus on text modality only. However, with the development of social media, people can create hyperbolic expressions with various modalities, including text, images, videos, etc. In this paper, we focus on multimodal hyperbole detection. We create a multimodal detection dataset from Weibo (a Chinese social media) and carry out some studies on it. We treat the text and image from a piece of weibo as two modalities and explore the role of text and image for hyperbole detection. Different pre-trained multimodal encoders are also evaluated on this downstream task to show their performance. Besides, since this dataset is constructed from five different topics, we also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Law · Handwritten Text Recognition Techniques
MethodsFocus
