When normalization hallucinates: unseen risks in AI-powered whole slide image processing

Karel Moens; Matthew B. Blaschko; Tinne Tuytelaars; Bart Diricx; Jonas De Vylder; Mustafa Yousif

arXiv:2512.07426·cs.CV·December 9, 2025

When normalization hallucinates: unseen risks in AI-powered whole slide image processing

Karel Moens, Matthew B. Blaschko, Tinne Tuytelaars, Bart Diricx, Jonas De Vylder, Mustafa Yousif

PDF

Open Access

TL;DR

This paper highlights the risks of hallucinations in AI-based whole slide image normalization, demonstrating that current methods can produce artifacts that are hard to detect and may compromise clinical analysis, urging for better validation.

Contribution

The authors introduce a novel image comparison measure to automatically detect hallucinations and systematically evaluate normalization methods on real-world data, exposing their limitations.

Findings

01

Hallucinations are common in AI-normalized WSIs on real-world data.

02

Current evaluation metrics often overlook these hallucinations.

03

Proposed measure effectively detects artifacts not captured by traditional metrics.

Abstract

Whole slide image (WSI) normalization remains a vital preprocessing step in computational pathology. Increasingly driven by deep learning, these models learn to approximate data distributions from training examples. This often results in outputs that gravitate toward the average, potentially masking diagnostically important features. More critically, they can introduce hallucinated content, artifacts that appear realistic but are not present in the original tissue, posing a serious threat to downstream analysis. These hallucinations are nearly impossible to detect visually, and current evaluation practices often overlook them. In this work, we demonstrate that the risk of hallucinations is real and underappreciated. While many methods perform adequately on public datasets, we observe a concerning frequency of hallucinations when these same models are retrained and evaluated on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Cell Image Analysis Techniques · Digital Media Forensic Detection