Towards Low-Resource Harmful Meme Detection with LMM Agents
Jianzhao Huang, Hongzhan Lin, Ziyan Liu, Ziyang Luo, Guang Chen, Jing, Ma

TL;DR
This paper introduces an agency-driven framework utilizing Large Multimodal Models for effective harmful meme detection in low-resource scenarios, leveraging few-shot learning and multimodal reasoning to improve accuracy.
Contribution
It proposes a novel LMM-based approach that employs retrieval and knowledge revision strategies for low-resource harmful meme detection, enhancing generalization and reasoning capabilities.
Findings
Outperforms state-of-the-art methods on three meme datasets.
Effective in low-resource, few-shot scenarios.
Demonstrates strong multimodal reasoning for harm detection.
Abstract
The proliferation of Internet memes in the age of social media necessitates effective identification of harmful ones. Due to the dynamic nature of memes, existing data-driven models may struggle in low-resource scenarios where only a few labeled examples are available. In this paper, we propose an agency-driven framework for low-resource harmful meme detection, employing both outward and inward analysis with few-shot annotated samples. Inspired by the powerful capacity of Large Multimodal Models (LMMs) on multimodal reasoning, we first retrieve relative memes with annotations to leverage label information as auxiliary signals for the LMM agent. Then, we elicit knowledge-revising behavior within the LMM agent to derive well-generalized insights into meme harmfulness. By combining these strategies, our approach enables dialectical reasoning over intricate and implicit harm-indicative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Malware Detection Techniques · Network Security and Intrusion Detection · Spam and Phishing Detection
