Detecting Corporate AI-Washing via Cross-Modal Semantic Inconsistency Learning
Zhanjie Wen, Jingqiao Guo

TL;DR
This paper introduces AWASH, a multimodal framework for detecting corporate AI-washing by reasoning across text, images, and videos, outperforming existing methods and aiding regulatory analysis.
Contribution
It presents the first large-scale trimodal benchmark and a novel cross-modal inconsistency detection network for AI-washing detection.
Findings
CMID achieves an F1 score of 0.882 and AUC-ROC of 0.921.
The framework outperforms text-only and multimodal baselines significantly.
User study shows 43% reduction in review time and 28% increase in true positive detection.
Abstract
Corporate AI-washing-the strategic misrepresentation of AI capabilities via exaggerated or fabricated cross-channel disclosures-has emerged as a systemic threat to capital market information integrity with the widespread adoption of generative AI. Existing detection methods rely on single-modal text frequency analysis, suffering from vulnerability to adversarial reformulation and cross-channel obfuscation. This paper presents AWASH, a multimodal framework that redefines AI-washing detection as cross-modal claim-evidence reasoning (instead of surface-level similarity measurement), built on AW-Bench-the first large-scale trimodal benchmark for this task, including 88412 aligned annual report text, disclosure image, and earnings call video triplets from 4892 A-share listed firms during 2019Q1-2025Q2. We propose the Cross-Modal Inconsistency Detection (CMID) network, integrating a tri-modal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
