ICM-Assistant: Instruction-tuning Multimodal Large Language Models for   Rule-based Explainable Image Content Moderation

Mengyang Wu; Yuzhi Zhao; Jialun Cao; Mingjie Xu; Zhongming Jiang,; Xuehui Wang; Qinbin Li; Guangneng Hu; Shengchao Qin; Chi-Wing Fu

arXiv:2412.18216·cs.CV·January 22, 2025

ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation

Mengyang Wu, Yuzhi Zhao, Jialun Cao, Mingjie Xu, Zhongming Jiang,, Xuehui Wang, Qinbin Li, Guangneng Hu, Shengchao Qin, Chi-Wing Fu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces ICM-Assistant, a rule-based multimodal large language model designed for explainable and accurate image content moderation, outperforming existing models in classification and explanation quality.

Contribution

It presents a novel rule-based dataset generation pipeline and a specialized model that enhances flexibility, explainability, and accuracy in image content moderation tasks.

Findings

01

Significant improvement in moderation classification accuracy (36.8%)

02

Enhanced explanation quality (26.6%) over existing models

03

Effective application of rule-based prompts in multimodal models

Abstract

Controversial contents largely inundate the Internet, infringing various cultural norms and child protection standards. Traditional Image Content Moderation (ICM) models fall short in producing precise moderation decisions for diverse standards, while recent multimodal large language models (MLLMs), when adopted to general rule-based ICM, often produce classification and explanation results that are inconsistent with human moderators. Aiming at flexible, explainable, and accurate ICM, we design a novel rule-based dataset generation pipeline, decomposing concise human-defined rules and leveraging well-designed multi-stage prompts to enrich short explicit image annotations. Our ICM-Instruct dataset includes detailed moderation explanation and moderation Q-A pairs. Built upon it, we create our ICM-Assistant model in the framework of rule-based ICM, making it readily applicable in real…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhaoyuzhi/icm-assistant
pytorchOfficial

Videos

ICM-Assistant: Instruction-tuning Multimodal Large Language Models for Rule-based Explainable Image Content Moderation· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis