TL;DR
FaithEIR is a diffusion-based framework for extreme image rescaling that uses learnable reversible transformations and semantic priors to improve reconstruction fidelity and perceptual quality.
Contribution
It introduces a novel invertible transformation and adaptive detail prior to enhance extreme image rescaling performance.
Findings
Outperforms state-of-the-art methods in fidelity and perceptual quality.
Achieves superior results on extreme 16x rescaling tasks.
Provides a lightweight semantic embedder for improved conditioning.
Abstract
Most recent extreme rescaling methods struggle to preserve semantically consistent structures and produce realistic details, due to the severely ill-posed nature of low- to high-resolution mapping under scaling factors of or higher. To alleviate the above problems, we propose FaithEIR, a diffusion-based framework for extreme image rescaling. Inspired by singular value decomposition, we develop learnable reversible transformation that enables invertible downscaling and upscaling in the latent space. To compensate for information loss due to quantization, we propose an adaptive detail prior, a high-frequency dictionary that captures the empirical average of commonly occurring structures in the training data. Finally, we design a lightweight pixel semantic embedder to provide semantic conditioning for the pretrained diffusion model. We present extensive experimental results…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
