VModA: An Effective Framework for Adaptive NSFW Image Moderation

Han Bao; Qinying Wang; Zhi Chen; Qingming Li; Xuhong Zhang; Changjiang Li; Zonghui Wang; Shouling Ji; Wenzhi Chen

arXiv:2505.23386·cs.CV·May 30, 2025

VModA: An Effective Framework for Adaptive NSFW Image Moderation

Han Bao, Qinying Wang, Zhi Chen, Qingming Li, Xuhong Zhang, Changjiang Li, Zonghui Wang, Shouling Ji, Wenzhi Chen

PDF

Open Access

TL;DR

VModA is a versatile framework that significantly improves the accuracy and adaptability of NSFW image detection across diverse content types and regulations, addressing current method limitations.

Contribution

The paper introduces VModA, a novel adaptive framework that enhances NSFW detection accuracy and handles complex semantics and varying moderation rules.

Findings

01

Achieves up to 54.3% accuracy improvement over existing methods.

02

Demonstrates strong adaptability across categories, scenarios, and models.

03

Re-annotated and corrected datasets, improving benchmark quality.

Abstract

Not Safe/Suitable for Work (NSFW) content is rampant on social networks and poses serious harm to citizens, especially minors. Current detection methods mainly rely on deep learning-based image recognition and classification. However, NSFW images are now presented in increasingly sophisticated ways, often using image details and complex semantics to obscure their true nature or attract more views. Although still understandable to humans, these images often evade existing detection methods, posing a significant threat. Further complicating the issue, varying regulations across platforms and regions create additional challenges for effective moderation, leading to detection bias and reduced accuracy. To address this, we propose VModA, a general and effective framework that adapts to diverse moderation rules and handles complex, semantically rich NSFW content across categories.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHate Speech and Cyberbullying Detection · Adversarial Robustness in Machine Learning · Spam and Phishing Detection

MethodsBalanced Selection