The Art of Deception: Color Visual Illusions and Diffusion Models

Alex Gomez-Villa; Kai Wang; Alejandro C. Parraga; Bartlomiej; Twardowski; Jesus Malo; Javier Vazquez-Corral; Joost van de Weijer

arXiv:2412.10122·cs.CV·December 16, 2024

The Art of Deception: Color Visual Illusions and Diffusion Models

Alex Gomez-Villa, Kai Wang, Alejandro C. Parraga, Bartlomiej, Twardowski, Jesus Malo, Javier Vazquez-Corral, Joost van de Weijer

PDF

TL;DR

This paper investigates how diffusion models encode and generate visual illusions, revealing human-like perception shifts and demonstrating their ability to produce new illusions that can deceive both models and humans.

Contribution

It is the first to show that diffusion models encode visual illusions similarly to humans and can generate new illusions in realistic images using text-to-image models.

Findings

01

Diffusion models exhibit human-like brightness and color shifts in their latent space.

02

They can predict and generate visual illusions in images.

03

Model-generated illusions can also fool human observers.

Abstract

Visual illusions in humans arise when interpreting out-of-distribution stimuli: if the observer is adapted to certain statistics, perception of outliers deviates from reality. Recent studies have shown that artificial neural networks (ANNs) can also be deceived by visual illusions. This revelation raises profound questions about the nature of visual information. Why are two independent systems, both human brains and ANNs, susceptible to the same illusions? Should any ANN be capable of perceiving visual illusions? Are these perceptions a feature or a flaw? In this work, we study how visual illusions are encoded in diffusion models. Remarkably, we show that they present human-like brightness/color shifts in their latent space. We use this fact to demonstrate that diffusion models can predict visual illusions. Furthermore, we also show how to generate new unseen visual illusions in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDiffusion