MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection

Ziyan Liu; Chunxiao Fan; Haoran Lou; Yuexin Wu; Kaiwei Deng

arXiv:2507.06908·cs.CL·July 10, 2025

MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection

Ziyan Liu, Chunxiao Fan, Haoran Lou, Yuexin Wu, Kaiwei Deng

PDF

Open Access 1 Video

TL;DR

MIND is a novel multi-agent framework that enables zero-shot harmful meme detection by leveraging retrieval, insight derivation, and debate mechanisms, outperforming existing methods without relying on annotated data.

Contribution

The paper introduces a new multi-agent framework for zero-shot harmful meme detection that does not depend on annotated data and demonstrates strong generalization capabilities.

Findings

01

Outperforms existing zero-shot meme detection methods

02

Shows strong generalization across different models and scales

03

Effective in detecting harmful memes without annotated training data

Abstract

The rapid expansion of memes on social media has highlighted the urgent need for effective approaches to detect harmful content. However, traditional data-driven approaches struggle to detect new memes due to their evolving nature and the lack of up-to-date annotated data. To address this issue, we propose MIND, a multi-agent framework for zero-shot harmful meme detection that does not rely on annotated data. MIND implements three key strategies: 1) We retrieve similar memes from an unannotated reference set to provide contextual information. 2) We propose a bi-directional insight derivation mechanism to extract a comprehensive understanding of similar memes. 3) We then employ a multi-agent debate mechanism to ensure robust decision-making through reasoned arbitration. Extensive experiments on three meme datasets demonstrate that our proposed framework not only outperforms existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection· underline

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts · Sentiment Analysis and Opinion Mining