CogMorph: Cognitive Morphing Attacks for Text-to-Image Models

Zonglei Jing; Zonghao Ying; Le Wang; Siyuan Liang; Aishan Liu,; Xianglong Liu; Dacheng Tao

arXiv:2501.11815·cs.CV·January 23, 2025

CogMorph: Cognitive Morphing Attacks for Text-to-Image Models

Zonglei Jing, Zonghao Ying, Le Wang, Siyuan Liang, Aishan Liu,, Xianglong Liu, Dacheng Tao

PDF

Open Access

TL;DR

This paper introduces CogMorph, a novel attack method that manipulates text-to-image models to generate images with harmful contextual elements, revealing a new ethical risk in AI-generated imagery.

Contribution

The paper presents a new cognitive morphing attack technique, including a toxicity taxonomy and hierarchical prompt manipulation, to embed harmful content in generated images.

Findings

01

CogMorph significantly outperforms baselines (+20.62% accuracy)

02

Effective in both open-source and commercial T2I models

03

Reveals ethical risks in current text-to-image generation

Abstract

The development of text-to-image (T2I) generative models, that enable the creation of high-quality synthetic images from textual prompts, has opened new frontiers in creative design and content generation. However, this paper reveals a significant and previously unrecognized ethical risk inherent in this technology and introduces a novel method, termed the Cognitive Morphing Attack (CogMorph), which manipulates T2I models to generate images that retain the original core subjects but embeds toxic or harmful contextual elements. This nuanced manipulation exploits the cognitive principle that human perception of concepts is shaped by the entire visual scene and its context, producing images that amplify emotional harm far beyond attacks that merely preserve the original semantics. To address this, we first construct an imagery toxicity taxonomy spanning 10 major and 48 sub-categories,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Face recognition and analysis · Multimodal Machine Learning Applications

MethodsBalanced Selection