Negative Token Merging: Image-based Adversarial Feature Guidance

Jaskirat Singh; Lindsey Li; Weijia Shi; Ranjay Krishna; Yejin Choi,; Pang Wei Koh; Michael F. Cohen; Stephen Gould; Liang Zheng; Luke Zettlemoyer

arXiv:2412.01339·cs.CV·December 6, 2024

Negative Token Merging: Image-based Adversarial Feature Guidance

Jaskirat Singh, Lindsey Li, Weijia Shi, Ranjay Krishna, Yejin Choi,, Pang Wei Koh, Michael F. Cohen, Stephen Gould, Liang Zheng, Luke Zettlemoyer

PDF

Open Access

TL;DR

This paper introduces NegToMe, a training-free visual feature guidance method for diffusion models that enhances diversity and reduces copyrighted content similarity by selectively pushing apart features during image generation.

Contribution

NegToMe is the first approach to perform adversarial guidance directly with visual features, improving diversity and copyright protection without additional training.

Findings

01

NegToMe significantly increases output diversity.

02

Reduces visual similarity to copyrighted images by 34.57%.

03

Compatible with various diffusion architectures, including Flux.

Abstract

Text-based adversarial guidance using a negative prompt has emerged as a widely adopted approach to steer diffusion models away from producing undesired concepts. While useful, performing adversarial guidance using text alone can be insufficient to capture complex visual concepts or avoid specific visual elements like copyrighted characters. In this paper, for the first time we explore an alternate modality in this direction by performing adversarial guidance directly using visual features from a reference image or other images in a batch. We introduce negative token merging (NegToMe), a simple but effective training-free approach which performs adversarial guidance through images by selectively pushing apart matching visual features between reference and generated images during the reverse diffusion process. By simply adjusting the used reference, NegToMe enables a diverse range of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection

MethodsDiffusion