When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm

Ye Leng; Junjie Chu; Mingjie Li; Chenhao Lin; Chao Shen; Michael Backes; Yun Shen; Yang Zhang

arXiv:2603.24079·cs.CV·March 26, 2026

When Understanding Becomes a Risk: Authenticity and Safety Risks in the Emerging Image Generation Paradigm

Ye Leng, Junjie Chu, Mingjie Li, Chenhao Lin, Chao Shen, Michael Backes, Yun Shen, Yang Zhang

PDF

Open Access

TL;DR

This paper analyzes the safety risks of multimodal large language models (MLLMs), revealing they generate more unsafe and harder-to-detect images than diffusion models, raising concerns about their real-world safety implications.

Contribution

It systematically compares safety risks of MLLMs and diffusion models, highlighting their increased unsafe content generation and challenges in detection.

Findings

01

MLLMs produce more unsafe images than diffusion models.

02

MLLM-generated images are harder to detect even with retrained detectors.

03

Enhanced semantic understanding in MLLMs leads to greater safety risks.

Abstract

Recently, multimodal large language models (MLLMs) have emerged as a unified paradigm for language and image generation. Compared with diffusion models, MLLMs possess a much stronger capability for semantic understanding, enabling them to process more complex textual inputs and comprehend richer contextual meanings. However, this enhanced semantic ability may also introduce new and potentially greater safety risks. Taking diffusion models as a reference point, we systematically analyze and compare the safety risks of emerging MLLMs along two dimensions: unsafe content generation and fake image synthesis. Across multiple unsafe generation benchmark datasets, we observe that MLLMs tend to generate more unsafe images than diffusion models. This difference partly arises because diffusion models often fail to interpret abstract prompts, producing corrupted outputs, whereas MLLMs can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Misinformation and Its Impacts · Topic Modeling