KID: Knowledge-Injected Dual-Head Learning for Knowledge-Grounded Harmful Meme Detection
Yaocong Li, Leihan Zhang, Le Zhang, Qiang Yan

TL;DR
KID introduces a knowledge-grounded dual-head learning framework that enhances harmful meme detection by explicitly integrating background knowledge and structured reasoning, achieving state-of-the-art results across multiple languages.
Contribution
The paper presents a novel framework combining knowledge injection and dual-head learning for improved understanding of harmful memes, addressing limitations of prior intra-modal analysis methods.
Findings
Achieves state-of-the-art performance on multilingual datasets
Improves detection accuracy by 2.1% to 19.7% over previous methods
Validates effectiveness of knowledge injection and dual-head architecture
Abstract
Internet memes have become pervasive carriers of digital culture on social platforms. However, their heavy reliance on metaphors and sociocultural context also makes them subtle vehicles for harmful content, posing significant challenges for automated content moderation. Existing approaches primarily focus on intra-modal and inter-modal signal analysis, while the understanding of implicit toxicity often depends on background knowledge that is not explicitly present in the meme itself. To address this challenge, we propose KID, a Knowledge-Injected Dual-Head Learning framework for knowledge-grounded harmful meme detection. KID adopts a label-constrained distillation paradigm to decompose complex meme understanding into structured reasoning chains that explicitly link visual evidence, background knowledge, and classification labels. These chains guide the learning process by grounding…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts · Sentiment Analysis and Opinion Mining
