TL;DR
QuantVSR introduces a low-bit quantization approach for real-world video super-resolution, utilizing a spatio-temporal complexity aware mechanism and bias alignment to maintain high performance while reducing resource consumption.
Contribution
The paper presents a novel quantization method with a complexity-aware layer-specific rank allocation and learnable bias alignment for VSR models.
Findings
Achieves comparable performance to full-precision models.
Significantly outperforms recent low-bit quantization methods.
Effective on both synthetic and real-world datasets.
Abstract
Diffusion models have shown superior performance in real-world video super-resolution (VSR). However, the slow processing speeds and heavy resource consumption of diffusion models hinder their practical application and deployment. Quantization offers a potential solution for compressing the VSR model. Nevertheless, quantizing VSR models is challenging due to their temporal characteristics and high fidelity requirements. To address these issues, we propose QuantVSR, a low-bit quantization model for real-world VSR. We propose a spatio-temporal complexity aware (STCA) mechanism, where we first utilize the calibration dataset to measure both spatial and temporal complexities for each layer. Based on these statistics, we allocate layer-specific ranks to the low-rank full-precision (FP) auxiliary branch. Subsequently, we jointly refine the FP and low-bit branches to achieve simultaneous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
