Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification

Wenhao Qian; Zhenzhen Hu; Zijie Song; Jia Li

arXiv:2505.11237·cs.MM·April 20, 2026

Concept Drift Guided LayerNorm Tuning for Efficient Multimodal Metaphor Identification

Wenhao Qian, Zhenzhen Hu, Zijie Song, Jia Li

PDF

1 Repo

TL;DR

This paper presents CDGLT, a training-efficient framework for multimodal metaphor identification that leverages concept drift and prompt construction to improve accuracy and reduce costs.

Contribution

Introduction of Concept Drift Guided LayerNorm Tuning (CDGLT), a novel method combining concept drift and prompt strategies for efficient multimodal metaphor detection.

Findings

01

Achieves state-of-the-art results on MET-Meme benchmark.

02

Reduces training costs significantly compared to generative methods.

03

Effective ablation results confirm the importance of both innovations.

Abstract

Metaphorical imagination, the ability to connect seemingly unrelated concepts, is fundamental to human cognition and communication. While understanding linguistic metaphors has advanced significantly, grasping multimodal metaphors, such as those found in internet memes, presents unique challenges due to their unconventional expressions and implied meanings. Existing methods for multimodal metaphor identification often struggle to bridge the gap between literal and figurative interpretations. Additionally, generative approaches that utilize large language models or text-to-image models, while promising, suffer from high computational costs. This paper introduces \textbf{C}oncept \textbf{D}rift \textbf{G}uided \textbf{L}ayerNorm \textbf{T}uning (\textbf{CDGLT}), a novel and training-efficient framework for multimodal metaphor identification. CDGLT incorporates two key innovations: (1)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Qianvenh/CDGLT
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.