Local Intrinsic Dimension Unveils Hallucinations in Diffusion Models
Bartlomiej Sobieski, Matthew Tivnan, Dawid P{\l}udowski, Micha{\l} Jan W{\l}odarczyk, Pengfei Jin, Przemyslaw Biecek, Quanzheng Li

TL;DR
This paper introduces a new perspective on diffusion model hallucinations, identifying local intrinsic dimension as a key factor and proposing a corrective method called Intrinsic Quenching to improve structural consistency.
Contribution
It reveals local intrinsic dimension as the main driver of hallucinations and introduces Intrinsic Quenching, a novel method to reduce these hallucinations effectively.
Findings
Intrinsic Quenching outperforms existing hallucination reduction methods.
LID is identified as the primary driver of hallucinations.
IQ improves anatomical consistency in medical imaging tasks.
Abstract
Diffusion models are prone to generating structural hallucinations - samples that match the statistical properties of the training data yet defy underlying structural rules, resulting in anomalies like hands with more than five fingers. Recent research studied this failure mode from several viewpoints, offering partial explanations to their occurrence, such as mode interpolation. In this work, we propose a complementary perspective that treats hallucinations as instabilities on the model-induced manifold. We begin by showing that a hallucination filter based on such instabilities matches or exceeds the performance of the recently proposed temporal one. By tracing the source of these instabilities, we identify local intrinsic dimension (LID) as their primary driver and propose Intrinsic Quenching (IQ), a direct corrective mechanism that deflates it to alleviate hallucinations. IQ…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
