TL;DR
This paper presents ResQu, a novel image super-resolution framework combining quaternion wavelet preprocessing with latent diffusion models to improve perceptual quality and structural fidelity.
Contribution
It introduces a quaternion wavelet- and time-aware encoder within diffusion models, enhancing conditioning and outperforming existing methods in super-resolution tasks.
Findings
Outperforms existing approaches in perceptual quality.
Achieves higher standard evaluation metrics.
Demonstrates effectiveness on domain-specific datasets.
Abstract
Image Super-Resolution is a fundamental problem in computer vision with broad applications spacing from medical imaging to satellite analysis. The ability to reconstruct high-resolution images from low-resolution inputs is crucial for enhancing downstream tasks such as object detection and segmentation. While deep learning has significantly advanced SR, achieving high-quality reconstructions with fine-grained details and realistic textures remains challenging, particularly at high upscaling factors. Recent approaches leveraging diffusion models have demonstrated promising results, yet they often struggle to balance perceptual quality with structural fidelity. In this work, we introduce ResQu a novel SR framework that integrates a quaternion wavelet preprocessing framework with latent diffusion models, incorporating a new quaternion wavelet- and time-aware encoder. Unlike prior methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
