Align & Invert: Solving Inverse Problems with Diffusion and Flow-based Models via Representation Alignment
Loukas Sfountouris, Giannis Daras, Paris Giampouras

TL;DR
This paper introduces REPA, a representation alignment method that enhances inverse problem solutions by aligning diffusion or flow models with pretrained encoders, leading to improved reconstruction quality and efficiency.
Contribution
We extend representation alignment to inverse problems, integrating it with diffusion and flow models and providing theoretical insights into its regularization effects.
Findings
REPA improves reconstruction quality across multiple inverse tasks.
Aligning model representations enhances perceptual realism.
REPA reduces the number of steps needed for high-quality reconstructions.
Abstract
Enforcing alignment between the internal representations of diffusion or flow-based generative models and those of pretrained self-supervised encoders has recently been shown to provide a powerful inductive bias, improving both convergence and sample quality. In this work, we extend this idea to inverse problems, where pretrained generative models are employed as priors. We propose applying representation alignment (REPA) between diffusion or flow-based models and a DINOv2 visual encoder, to guide the reconstruction process at inference time. Although ground-truth signals are unavailable in inverse problems, we empirically show that aligning model representations of approximate target features can substantially enhance reconstruction quality and perceptual realism. We provide theoretical results showing (a) that REPA regularization can be viewed as a variational approach for minimizing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · 3D Shape Modeling and Analysis · Face recognition and analysis
