Moderating Illicit Online Image Promotion for Unsafe User-Generated Content Games Using Large Vision-Language Models
Keyan Guo, Ayush Utkarsh, Wenbo Ding, Isabelle Ondracek, Ziming Zhao,, Guo Freeman, Nishant Vishwamitra, Hongxin Hu

TL;DR
This paper introduces UGCG-Guard, a novel system using large vision-language models and zero-shot domain adaptation to effectively detect illicit promotional images for unsafe user-generated content games, achieving 94% accuracy.
Contribution
The study presents the first dataset of illicit UGCG promotion images and develops UGCG-Guard, a system that leverages large vision-language models with a novel prompting strategy for zero-shot detection.
Findings
UGCG-Guard achieves 94% detection accuracy.
A new dataset of 2,924 illicit promotional images is introduced.
The approach demonstrates effective zero-shot domain adaptation.
Abstract
Online user generated content games (UGCGs) are increasingly popular among children and adolescents for social interaction and more creative online entertainment. However, they pose a heightened risk of exposure to explicit content, raising growing concerns for the online safety of children and adolescents. Despite these concerns, few studies have addressed the issue of illicit image-based promotions of unsafe UGCGs on social media, which can inadvertently attract young users. This challenge arises from the difficulty of obtaining comprehensive training data for UGCG images and the unique nature of these images, which differ from traditional unsafe content. In this work, we take the first step towards studying the threat of illicit promotions of unsafe UGCGs. We collect a real-world dataset comprising 2,924 images that display diverse sexually explicit and violent content used to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Malware Detection Techniques · Spam and Phishing Detection · Digital Games and Media
