Image Matters: A New Dataset and Empirical Study for Multimodal   Hyperbole Detection

Huixuan Zhang; Xiaojun Wan

arXiv:2307.00209·cs.CV·March 12, 2024

Image Matters: A New Dataset and Empirical Study for Multimodal Hyperbole Detection

Huixuan Zhang, Xiaojun Wan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a new multimodal dataset from Weibo for hyperbole detection, evaluates various models on it, and analyzes cross-domain performance, advancing understanding of multimodal exaggeration detection.

Contribution

It presents the first multimodal hyperbole detection dataset and benchmarks, exploring the roles of text and images and assessing model performance across topics.

Findings

01

Pre-trained multimodal encoders show varying effectiveness.

02

Cross-domain performance varies significantly among models.

03

The dataset serves as a benchmark for future research.

Abstract

Hyperbole, or exaggeration, is a common linguistic phenomenon. The detection of hyperbole is an important part of understanding human expression. There have been several studies on hyperbole detection, but most of which focus on text modality only. However, with the development of social media, people can create hyperbolic expressions with various modalities, including text, images, videos, etc. In this paper, we focus on multimodal hyperbole detection. We create a multimodal detection dataset from Weibo (a Chinese social media) and carry out some studies on it. We treat the text and image from a piece of weibo as two modalities and explore the role of text and image for hyperbole detection. Different pre-trained multimodal encoders are also evaluated on this downstream task to show their performance. Besides, since this dataset is constructed from five different topics, we also…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lleozhang/multimodal_hyperbole
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Law · Handwritten Text Recognition Techniques

MethodsFocus