TL;DR
This paper introduces CHASM, a novel dataset for evaluating multimodal large language models' ability to detect covert advertisements on Chinese social media, revealing current models' limitations and the need for improved detection methods.
Contribution
The creation of CHASM, a high-quality, real-world dataset for assessing MLLMs in identifying covert ads, and an analysis of current models' performance and challenges.
Findings
Current MLLMs perform poorly in detecting covert advertisements.
Fine-tuning improves detection but challenges remain in subtle cues.
Detecting comments and visual-text discrepancies is particularly difficult.
Abstract
Current benchmarks for evaluating large language models (LLMs) in social media moderation completely overlook a serious threat: covert advertisements, which disguise themselves as regular posts to deceive and mislead consumers into making purchases, leading to significant ethical and legal concerns. In this paper, we present the CHASM, a first-of-its-kind dataset designed to evaluate the capability of Multimodal Large Language Models (MLLMs) in detecting covert advertisements on social media. CHASM is a high-quality, anonymized, manually curated dataset consisting of 4,992 instances, based on real-world scenarios from the Chinese social media platform Rednote. The dataset was collected and annotated under strict privacy protection and quality control protocols. It includes many product experience sharing posts that closely resemble covert advertisements, making the dataset particularly…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
