CompSRT: Quantization and Pruning for Image Super Resolution Transformers

Dorsa Zeinali; Hailing Wang; Yitian Zhang; Yun Fu

arXiv:2601.21069·eess.IV·February 6, 2026

CompSRT: Quantization and Pruning for Image Super Resolution Transformers

Dorsa Zeinali, Hailing Wang, Yitian Zhang, Yun Fu

PDF

Open Access 3 Reviews

TL;DR

This paper introduces CompSRT, a novel compression method for image super resolution transformers that combines Hadamard-based quantization and pruning, significantly improving performance and reducing model size.

Contribution

The paper presents a new compression technique for SR transformers that leverages Hadamard transforms and scalar decomposition, outperforming existing methods in accuracy and efficiency.

Findings

01

Achieves up to 1.53 dB improvement over SOTA

02

Reduces blurriness in images at various bitwidths

03

Prunes 40% of weights with minimal performance loss

Abstract

Model compression has become an important tool for making image super resolution models more efficient. However, the gap between the best compressed models and the full precision model still remains large and a need for deeper understanding of compression theory on more performant models remains. Prior research on quantization of LLMs has shown that Hadamard transformations lead to weights and activations with reduced outliers, which leads to improved performance. We argue that while the Hadamard transform does reduce the effect of outliers, an empirical analysis on how the transform functions remains needed. By studying the distributions of weights and activations of SwinIR-light, we show with statistical analysis that lower errors is caused by the Hadamard transforms ability to reduce the ranges, and increase the proportion of values around $0$ . Based on these findings, we introduce…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 2Confidence 5

Strengths

* The paper targets an interesting direction.

Weaknesses

The paper lacks a sufficiently comprehensive literature review and contains several inaccurate claims: * At Line 73, the authors state that FlatQuant is a Hadamard-based method. This is incorrect — FlatQuant uses a learnable matrix instead of a fixed Hadamard transformation. * The success of Hadamard or rotation-based transformations has already been extensively discussed in prior works [a, b, c]. These studies attribute the benefit to incoherent processing, as analyzed in QuIP[a]. However, the

Reviewer 02Rating 6Confidence 3

Strengths

1) The paper provides consistent PSNR/SSIM improvements across five datasets and three scales at 2–4 bits. 2) The experimental evaluation is very detailed and the results are supported by various statistical tests. The authors demonstrated that the Hadamard transformations reduce quantization errors in matrices by reducing the ranges of the values, and concentrating values around 0. The paper goes beyond intuition, using paired tests and effect sizes to argue the Hadamard's benefit arises from

Weaknesses

1) Because CondiQuant code is unavailable, qualitative results use 2DQuant; this limits the strength of the visual SOTA claim. A controlled, code‑level visual comparison with CondiQuant (if possible) or an alternative strong visual baseline would help. 2) The paper reports a per‑image overhead due to extra FP Hadamard operations. There is no end‑to‑end profiling (e.g., batch throughput, latency on edge hardware, kernel fusion feasibility). 3) The 1‑bit dense mask plus storing the pruned tensor

Reviewer 03Rating 4Confidence 4

Strengths

1. The paper provides a fairly comprehensive validation within SwinIR-light of the mechanism by which Hadamard transforms are effective. 2. By decoupling the quantization step size and zero-point into two learnable parameters, the method intuitively increases representational and optimization flexibility while remaining simple to implement and directly pluggable into existing PTQ pipelines. 3. An integrated practice combining pruning and quantization is presented.

Weaknesses

1. The claim that “This operation has been said to flatten the matrices by distributing the magnitude of outliers and that is how the errors get reduced, but the exact mechanism has not been explored nor has the flatness or normality of distributions been tested.” is not fully accurate. QuaRot [A] demonstrates end-to-end 4-bit LLM inference using Hadamard-based rotations to mitigate outliers, with distributional visualizations supporting the “flattening” effect. SpinQuant [B] shows the mechanism

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Image and Video Quality Assessment · Advanced Data Compression Techniques