CogMorph: Cognitive Morphing Attacks for Text-to-Image Models
Zonglei Jing, Zonghao Ying, Le Wang, Siyuan Liang, Aishan Liu,, Xianglong Liu, Dacheng Tao

TL;DR
This paper introduces CogMorph, a novel attack method that manipulates text-to-image models to generate images with harmful contextual elements, revealing a new ethical risk in AI-generated imagery.
Contribution
The paper presents a new cognitive morphing attack technique, including a toxicity taxonomy and hierarchical prompt manipulation, to embed harmful content in generated images.
Findings
CogMorph significantly outperforms baselines (+20.62% accuracy)
Effective in both open-source and commercial T2I models
Reveals ethical risks in current text-to-image generation
Abstract
The development of text-to-image (T2I) generative models, that enable the creation of high-quality synthetic images from textual prompts, has opened new frontiers in creative design and content generation. However, this paper reveals a significant and previously unrecognized ethical risk inherent in this technology and introduces a novel method, termed the Cognitive Morphing Attack (CogMorph), which manipulates T2I models to generate images that retain the original core subjects but embeds toxic or harmful contextual elements. This nuanced manipulation exploits the cognitive principle that human perception of concepts is shaped by the entire visual scene and its context, producing images that amplify emotional harm far beyond attacks that merely preserve the original semantics. To address this, we first construct an imagery toxicity taxonomy spanning 10 major and 48 sub-categories,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Multimodal Machine Learning Applications
MethodsBalanced Selection
