Spectral Collapse in Diffusion Inversion
Nicolas Bourriez, Alexandre Verine, Auguste Genovesio

TL;DR
This paper identifies a spectral collapse problem in diffusion inversion for unpaired image translation, where spectral sparsity causes loss of detail, and proposes Orthogonal Variance Guidance to improve texture and structure preservation.
Contribution
The paper introduces Orthogonal Variance Guidance, a novel inference-time correction method that addresses spectral collapse in diffusion models for better image reconstruction.
Findings
OVG restores photorealistic textures effectively.
Spectral collapse causes oversmoothing in diffusion inversion.
OVG maintains structural fidelity while enhancing textures.
Abstract
Conditional diffusion inversion provides a powerful framework for unpaired image-to-image translation. However, we demonstrate through an extensive analysis that standard deterministic inversion (e.g. DDIM) fails when the source domain is spectrally sparse compared to the target domain (e.g., super-resolution, sketch-to-image). In these contexts, the recovered latent from the input does not follow the expected isotropic Gaussian distribution. Instead it exhibits a signal with lower frequencies, locking target sampling to oversmoothed and texture-poor generations. We term this phenomenon spectral collapse. We observe that stochastic alternatives attempting to restore the noise variance tend to break the semantic link to the input, leading to structural drift. To resolve this structure-texture trade-off, we propose Orthogonal Variance Guidance (OVG), an inference-time method that corrects…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Advanced Neuroimaging Techniques and Applications · Advanced Image Processing Techniques
