TL;DR
This paper presents CDGLT, a training-efficient framework for multimodal metaphor identification that leverages concept drift and prompt construction to improve accuracy and reduce costs.
Contribution
Introduction of Concept Drift Guided LayerNorm Tuning (CDGLT), a novel method combining concept drift and prompt strategies for efficient multimodal metaphor detection.
Findings
Achieves state-of-the-art results on MET-Meme benchmark.
Reduces training costs significantly compared to generative methods.
Effective ablation results confirm the importance of both innovations.
Abstract
Metaphorical imagination, the ability to connect seemingly unrelated concepts, is fundamental to human cognition and communication. While understanding linguistic metaphors has advanced significantly, grasping multimodal metaphors, such as those found in internet memes, presents unique challenges due to their unconventional expressions and implied meanings. Existing methods for multimodal metaphor identification often struggle to bridge the gap between literal and figurative interpretations. Additionally, generative approaches that utilize large language models or text-to-image models, while promising, suffer from high computational costs. This paper introduces \textbf{C}oncept \textbf{D}rift \textbf{G}uided \textbf{L}ayerNorm \textbf{T}uning (\textbf{CDGLT}), a novel and training-efficient framework for multimodal metaphor identification. CDGLT incorporates two key innovations: (1)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
