MIND: A Multi-agent Framework for Zero-shot Harmful Meme Detection
Ziyan Liu, Chunxiao Fan, Haoran Lou, Yuexin Wu, Kaiwei Deng

TL;DR
MIND is a novel multi-agent framework that enables zero-shot harmful meme detection by leveraging retrieval, insight derivation, and debate mechanisms, outperforming existing methods without relying on annotated data.
Contribution
The paper introduces a new multi-agent framework for zero-shot harmful meme detection that does not depend on annotated data and demonstrates strong generalization capabilities.
Findings
Outperforms existing zero-shot meme detection methods
Shows strong generalization across different models and scales
Effective in detecting harmful memes without annotated training data
Abstract
The rapid expansion of memes on social media has highlighted the urgent need for effective approaches to detect harmful content. However, traditional data-driven approaches struggle to detect new memes due to their evolving nature and the lack of up-to-date annotated data. To address this issue, we propose MIND, a multi-agent framework for zero-shot harmful meme detection that does not rely on annotated data. MIND implements three key strategies: 1) We retrieve similar memes from an unannotated reference set to provide contextual information. 2) We propose a bi-directional insight derivation mechanism to extract a comprehensive understanding of similar memes. 3) We then employ a multi-agent debate mechanism to ensure robust decision-making through reasoned arbitration. Extensive experiments on three meme datasets demonstrate that our proposed framework not only outperforms existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts · Sentiment Analysis and Opinion Mining
