KID: Knowledge-Injected Dual-Head Learning for Knowledge-Grounded Harmful Meme Detection

Yaocong Li; Leihan Zhang; Le Zhang; Qiang Yan

arXiv:2601.21796·cs.CL·January 30, 2026

KID: Knowledge-Injected Dual-Head Learning for Knowledge-Grounded Harmful Meme Detection

Yaocong Li, Leihan Zhang, Le Zhang, Qiang Yan

PDF

Open Access

TL;DR

KID introduces a knowledge-grounded dual-head learning framework that enhances harmful meme detection by explicitly integrating background knowledge and structured reasoning, achieving state-of-the-art results across multiple languages.

Contribution

The paper presents a novel framework combining knowledge injection and dual-head learning for improved understanding of harmful memes, addressing limitations of prior intra-modal analysis methods.

Findings

01

Achieves state-of-the-art performance on multilingual datasets

02

Improves detection accuracy by 2.1% to 19.7% over previous methods

03

Validates effectiveness of knowledge injection and dual-head architecture

Abstract

Internet memes have become pervasive carriers of digital culture on social platforms. However, their heavy reliance on metaphors and sociocultural context also makes them subtle vehicles for harmful content, posing significant challenges for automated content moderation. Existing approaches primarily focus on intra-modal and inter-modal signal analysis, while the understanding of implicit toxicity often depends on background knowledge that is not explicitly present in the meme itself. To address this challenge, we propose KID, a Knowledge-Injected Dual-Head Learning framework for knowledge-grounded harmful meme detection. KID adopts a label-constrained distillation paradigm to decompose complex meme understanding into structured reasoning chains that explicitly link visual evidence, background knowledge, and classification labels. These chains guide the learning process by grounding…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Misinformation and Its Impacts · Sentiment Analysis and Opinion Mining