Images Amplify Misinformation Sharing in Vision-Language Models
Alice Plebe, Timothy Douglas, Diana Riazi, R. Maria del Rio-Chanona

TL;DR
This study investigates how images influence vision-language models' tendency to reshare misinformation, revealing that images increase false news sharing and that model responses vary with persona traits and model types.
Contribution
First comprehensive analysis of visual influence on VLMs' misinformation sharing behavior, highlighting biases and proposing evaluation and mitigation considerations.
Findings
Images increase false news resharing by 14.5%.
Persona traits like Dark Triad amplify false news sharing.
Claude-3-Haiku shows robustness to visual misinformation.
Abstract
As language and vision-language models (VLMs) become central to information access and online interaction, concerns grow about their potential to amplify misinformation. Human studies show that images boost the perceived credibility and shareability of information, raising the question of whether VLMs exhibit the same vulnerability. We present the first study examining how images influence VLMs' propensity to reshare news content, how this effect varies across model families, and how persona conditioning and content attributes modulate such behavior. We develop a jailbreaking-inspired prompting strategy that bypasses VLMs' default refusals to engage with controversial news, allowing them to generate resharing decisions across diverse topics and elicited traits, including antisocial ones. We evaluate four state-of-the-art VLMs on a novel multimodal dataset of fact-checked political news…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
