Moderating Illicit Online Image Promotion for Unsafe User-Generated   Content Games Using Large Vision-Language Models

Keyan Guo; Ayush Utkarsh; Wenbo Ding; Isabelle Ondracek; Ziming Zhao,; Guo Freeman; Nishant Vishwamitra; Hongxin Hu

arXiv:2403.18957·cs.CY·August 13, 2024·2 cites

Moderating Illicit Online Image Promotion for Unsafe User-Generated Content Games Using Large Vision-Language Models

Keyan Guo, Ayush Utkarsh, Wenbo Ding, Isabelle Ondracek, Ziming Zhao,, Guo Freeman, Nishant Vishwamitra, Hongxin Hu

PDF

Open Access 2 Repos

TL;DR

This paper introduces UGCG-Guard, a novel system using large vision-language models and zero-shot domain adaptation to effectively detect illicit promotional images for unsafe user-generated content games, achieving 94% accuracy.

Contribution

The study presents the first dataset of illicit UGCG promotion images and develops UGCG-Guard, a system that leverages large vision-language models with a novel prompting strategy for zero-shot detection.

Findings

01

UGCG-Guard achieves 94% detection accuracy.

02

A new dataset of 2,924 illicit promotional images is introduced.

03

The approach demonstrates effective zero-shot domain adaptation.

Abstract

Online user generated content games (UGCGs) are increasingly popular among children and adolescents for social interaction and more creative online entertainment. However, they pose a heightened risk of exposure to explicit content, raising growing concerns for the online safety of children and adolescents. Despite these concerns, few studies have addressed the issue of illicit image-based promotions of unsafe UGCGs on social media, which can inadvertently attract young users. This challenge arises from the difficulty of obtaining comprehensive training data for UGCG images and the unique nature of these images, which differ from traditional unsafe content. In this work, we take the first step towards studying the threat of illicit promotions of unsafe UGCGs. We collect a real-world dataset comprising 2,924 images that display diverse sexually explicit and violent content used to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Malware Detection Techniques · Spam and Phishing Detection · Digital Games and Media