CompSRT: Quantization and Pruning for Image Super Resolution Transformers
Dorsa Zeinali, Hailing Wang, Yitian Zhang, Yun Fu

TL;DR
This paper introduces CompSRT, a novel compression method for image super resolution transformers that combines Hadamard-based quantization and pruning, significantly improving performance and reducing model size.
Contribution
The paper presents a new compression technique for SR transformers that leverages Hadamard transforms and scalar decomposition, outperforming existing methods in accuracy and efficiency.
Findings
Achieves up to 1.53 dB improvement over SOTA
Reduces blurriness in images at various bitwidths
Prunes 40% of weights with minimal performance loss
Abstract
Model compression has become an important tool for making image super resolution models more efficient. However, the gap between the best compressed models and the full precision model still remains large and a need for deeper understanding of compression theory on more performant models remains. Prior research on quantization of LLMs has shown that Hadamard transformations lead to weights and activations with reduced outliers, which leads to improved performance. We argue that while the Hadamard transform does reduce the effect of outliers, an empirical analysis on how the transform functions remains needed. By studying the distributions of weights and activations of SwinIR-light, we show with statistical analysis that lower errors is caused by the Hadamard transforms ability to reduce the ranges, and increase the proportion of values around . Based on these findings, we introduce…
Peer Reviews
Decision·Submitted to ICLR 2026
* The paper targets an interesting direction.
The paper lacks a sufficiently comprehensive literature review and contains several inaccurate claims: * At Line 73, the authors state that FlatQuant is a Hadamard-based method. This is incorrect — FlatQuant uses a learnable matrix instead of a fixed Hadamard transformation. * The success of Hadamard or rotation-based transformations has already been extensively discussed in prior works [a, b, c]. These studies attribute the benefit to incoherent processing, as analyzed in QuIP[a]. However, the
1) The paper provides consistent PSNR/SSIM improvements across five datasets and three scales at 2–4 bits. 2) The experimental evaluation is very detailed and the results are supported by various statistical tests. The authors demonstrated that the Hadamard transformations reduce quantization errors in matrices by reducing the ranges of the values, and concentrating values around 0. The paper goes beyond intuition, using paired tests and effect sizes to argue the Hadamard's benefit arises from
1) Because CondiQuant code is unavailable, qualitative results use 2DQuant; this limits the strength of the visual SOTA claim. A controlled, code‑level visual comparison with CondiQuant (if possible) or an alternative strong visual baseline would help. 2) The paper reports a per‑image overhead due to extra FP Hadamard operations. There is no end‑to‑end profiling (e.g., batch throughput, latency on edge hardware, kernel fusion feasibility). 3) The 1‑bit dense mask plus storing the pruned tensor
1. The paper provides a fairly comprehensive validation within SwinIR-light of the mechanism by which Hadamard transforms are effective. 2. By decoupling the quantization step size and zero-point into two learnable parameters, the method intuitively increases representational and optimization flexibility while remaining simple to implement and directly pluggable into existing PTQ pipelines. 3. An integrated practice combining pruning and quantization is presented.
1. The claim that “This operation has been said to flatten the matrices by distributing the magnitude of outliers and that is how the errors get reduced, but the exact mechanism has not been explored nor has the flatness or normality of distributions been tested.” is not fully accurate. QuaRot [A] demonstrates end-to-end 4-bit LLM inference using Hadamard-based rotations to mitigate outliers, with distributional visualizations supporting the “flattening” effect. SpinQuant [B] shows the mechanism
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Image Processing Techniques · Image and Video Quality Assessment · Advanced Data Compression Techniques
