Seeing Is No Longer Believing: Frontier Image Generation Models, Synthetic Visual Evidence, and Real-World Risk
Shuai Wu, Xue Li, Yanna Feng, Yufang Li, Zhijun Wang, and Ran Wang

TL;DR
This paper analyzes the rise of advanced synthetic image models, their societal risks, and proposes layered controls and policies to mitigate potential harms across various sectors.
Contribution
It provides a comprehensive technical and policy analysis of synthetic visual risks, introducing a risk framework and practical recommendations for mitigation.
Findings
Risk is driven by realism, text legibility, identity persistence, and distribution context.
Synthetic images pose threats in finance, medicine, news, law, and civic discourse.
Layered control strategies can mitigate societal harms.
Abstract
Frontier image generation has moved from artistic synthesis toward synthetic visual evidence. Systems such as GPT Image 2, Nano Banana Pro, Nano Banana 2, Grok Imagine, Qwen Image 2.0 Pro, and Seedream 5.0 Lite combine photorealistic rendering, readable typography, reference consistency, editing control, and in several cases reasoning or search-grounded image construction. These capabilities create large benefits for design, education, accessibility, and communication, yet they also weaken one of society's most common trust shortcuts: the belief that a plausible picture is a reliable record. This paper provides a source-grounded technical and policy analysis of synthetic visual risk. We first summarize the public capabilities of recent image models, then analyze public incidents involving fake crisis images, celebrity and public-figure imagery, medical scans, forged-looking documents,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
