HalluGen: Synthesizing Realistic and Controllable Hallucinations for Evaluating Image Restoration
Seunghoi Kim, Henry F. J. Tregidgo, Chen Jin, Matteo Figini, Daniel C. Alexander

TL;DR
HalluGen is a diffusion-based framework that synthesizes realistic, controllable hallucinations in images, enabling systematic evaluation and detection of hallucinations in safety-critical image restoration tasks like medical imaging.
Contribution
We introduce HalluGen, the first scalable method for synthesizing and evaluating realistic hallucinations in image restoration, along with a large annotated dataset for benchmarking.
Findings
HalluGen produces perceptually realistic hallucinations with controllable parameters.
The dataset enables systematic evaluation of hallucination detection methods.
Our proposed metric SHAFE improves sensitivity to hallucinations over traditional metrics.
Abstract
Generative models are prone to hallucinations: plausible but incorrect structures absent in the ground truth. This issue is problematic in image restoration for safety-critical domains such as medical imaging, industrial inspection, and remote sensing, where such errors undermine reliability and trust. For example, in low-field MRI, widely used in resource-limited settings, restoration models are essential for enhancing low-quality scans, yet hallucinations can lead to serious diagnostic errors. Progress has been hindered by a circular dependency: evaluating hallucinations requires labeled data, yet such labels are costly and subjective. We introduce HalluGen, a diffusion-based framework that synthesizes realistic hallucinations with controllable type, location, and severity, producing perceptually realistic but semantically incorrect outputs (segmentation IoU drops from 0.86 to 0.36).…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Image Processing Techniques · Hallucinations in medical conditions
