# Material fictions: Comparing physically based renderings and generative AI images through material perception

**Authors:** Yuguang Zhao, Jeroen Stumpel, Huib de Ridder, Jan Jaap R. van Assen, Maarten W. A. Wijntjes

PMC · DOI: 10.1167/jov.26.3.7 · Journal of Vision · 2026-03-16

## TL;DR

This study compares how generative AI and physically based rendering images are perceived by humans, exploring how they fit into a shared perceptual space for materials.

## Contribution

The study introduces a novel comparison of generative AI and PBR images in material perception using human similarity judgments.

## Key findings

- AI models generated images with two-dimensional perceptual spaces, differing from the one-dimensional MERL dataset.
- Stable Diffusion with depth-map constraints produced consistent perceptual spaces similar to MERL and other material studies.
- AI-generated images may serve as a new medium for exploring material perception.

## Abstract

Generative artificial intelligence (AI) models unlock new ways to create images, emerging as a new medium alongside paintings, photographs, physically based renderings (PBR), etc. Generative AI images can be perceptually convincing without being physically plausible, allowing to investigate the boundaries of visual perception. This study examines whether generative AI images adhere to a medium-independent perceptual space converged from previous studies. We compared the perceptual similarity of images from three generative AI models against a bidirectional reflectance distribution functions (BRDFs) PBR image dataset, using human similarity judgments. In experiment 1, we used the text descriptions of 32 materials (e.g., blue acrylic) from the Mitsubishi Electric Research Laboratories (MERL) BRDF dataset, prompting two text-to-image models, DALL-E 2 and Midjourney v2, to generate 32 sphere-shaped stimuli per model. Perceptual spaces derived from similarity judgments revealed that both AI models resulted in two-dimensional spaces whereas the MERL space was confined to one dimension, probably owing to a lack of surface texture. These unrelated perceptual spaces suggest the AI models generated unique and different images from identical text prompts. In experiment 2 we used the text-to-image model Stable Diffusion v1.5 with ControlNet for additional depth-map constraints. Using the same 32 descriptions, we generated 3 sets using 3 different depth maps. The three resulting perceptual spaces are all two-dimensional, exhibiting high similarity, indicating a robust and non-random structure. They also show a similar structure to the MERL space and perceptual spaces from other material studies using photographs, PBR, and depictions, suggesting AI-generated imagery may indeed be used as a new medium to explore material perception.

## Full-text entities

- **Diseases:** AI (MESH:C538142), MERL (MESH:D007757), cherry (MESH:D009081)
- **Chemicals:** AlexNet (-), charcoal (MESH:D002606), aluminium (MESH:D000535)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** DALL-E 2 — Mus musculus (Mouse), Carcinoma of the mouse prostate gland, Cancer cell line (CVCL_S003)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13001837/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13001837/full.md

## References

58 references — full list in the complete paper: https://tomesphere.com/paper/PMC13001837/full.md

---
Source: https://tomesphere.com/paper/PMC13001837