TL;DR
This paper introduces CONVEX, a large multimodal misinformation dataset, and analyzes how AI-generated content spreads, is detected, and impacts online discourse, emphasizing the need for adaptive detection strategies.
Contribution
The study provides a comprehensive dataset and analysis of multimodal misinformation dynamics, revealing challenges in detecting evolving AI-generated media.
Findings
AI-generated content achieves disproportionate virality
Passive engagement drives spread more than active discourse
Detection models' performance declines as generative models evolve
Abstract
As generative AI advances, the distinction between authentic and synthetic media is increasingly blurred, challenging the integrity of online information. In this study, we present CONVEX, a large-scale dataset of multimodal misinformation involving miscaptioned, edited, and AI-generated visual content, comprising over 150K multimodal posts with associated notes and engagement metrics from X's Community Notes. We analyze how multimodal misinformation evolves in terms of virality, engagement, and consensus dynamics, with a focus on synthetic media. Our results show that while AI-generated content achieves disproportionate virality, its spread is driven primarily by passive engagement rather than active discourse. Despite slower initial reporting, AI-generated content reaches community consensus more quickly once flagged. Moreover, our evaluation of specialized detectors and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
