BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards

Yiran Yang; Zhaowei Liu; Yuan Yuan; Yukun Song; Xiong Ma; Yinghao Song; Xiangji Zeng; Lu Sun; Yulu Wang; Hai Zhou; Shuai Cui; Zhaohan Gong; and Jiefei Zhang

arXiv:2602.18193·cs.CV·February 24, 2026

BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards

Yiran Yang, Zhaowei Liu, Yuan Yuan, Yukun Song, Xiong Ma, Yinghao Song, Xiangji Zeng, Lu Sun, Yulu Wang, Hai Zhou, Shuai Cui, Zhaohan Gong, and Jiefei Zhang

PDF

Open Access 1 Video

TL;DR

BLM-Guard is an explainable multimodal ad moderation framework that combines Chain-of-Thought reasoning, rule-based policy guidance, and reinforcement learning to improve detection accuracy and robustness in short-video ads.

Contribution

It introduces a novel rule-driven data synthesis pipeline and a reinforcement learning approach for policy-aligned, explainable multimodal ad moderation.

Findings

01

Outperforms strong baselines in accuracy and consistency

02

Enhances robustness to intra-modal manipulations and cross-modal mismatches

03

Demonstrates effective generalization on real-world short-video ads

Abstract

Short-video platforms now host vast multimodal ads whose deceptive visuals, speech and subtitles demand finer-grained, policy-driven moderation than community safety filters. We present BLM-Guard, a content-audit framework for commercial ads that fuses Chain-of-Thought reasoning with rule-based policy principles and a critic-guided reward. A rule-driven ICoT data-synthesis pipeline jump-starts training by generating structured scene descriptions, reasoning chains and labels, cutting annotation costs. Reinforcement learning then refines the model using a composite reward balancing causal coherence with policy adherence. A multitask architecture models intra-modal manipulations (e.g., exaggerated imagery) and cross-modal mismatches (e.g., subtitle-speech drift), boosting robustness. Experiments on real short-video ads show BLM-Guard surpasses strong baselines in accuracy, consistency and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

BLM-Guard: Explainable Multimodal Ad Moderation with Chain-of-Thought and Policy-Aligned Rewards· underline

Taxonomy

TopicsMultimodal Machine Learning Applications · Generative Adversarial Networks and Image Synthesis · Emotion and Mood Recognition